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RNA polymerase I transcription in vivo in transiendy DNA-transfected cells has been used for expression of influenza vRNA molecules 
coding for chloramphenicol acetyltransferase (CAT) in anti-sense orientation. Influenza virus superinfection served to provide viral RNA 
polymerase and other proteins for transcriptional conversion of minus-strand vRNA into plus-strand viral mRNA molecules expressing 
CAT activity. This system has been used for an analysis via nucleotide exchanges as well as deletions and insertions of both terminal 
segments of the vRNA sequence which cooperatively constitute the vRNA promoter structure. Several mutants with greatly enhanced 
expression rates over wild-type levels have been constructed, which also can be packaged and serially passaged into progeny virus. The 
data obtained for the mutations in various promoter elements support a model of consecutive, doublestrand vRNA promoter structures in 
binding of viral polymerase and initiation of RNA synthesis. Preparations of attenuated influenza virus for vaccination purposes include a 
single recombinant segment with promoter up mutation(s) for over-expression of an own or foreign gene product, which at the same time 
because of its over-replication serves to decrease the number of helper virus RNP segments. The same viruses further have been passaged 
through a step of ribozyme cleavage acting at one of the helper viral segments, which will delete this vital function and structure with high 
rates from the virus progeny. The resulting attenuated viruses will interact with their target cells in only one round of abortive infection, 
and are unable to produce viral progeny. 
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AN ATTENUATED VACCINATION AND GENE-TRANSFER VIRUS, A 
METHOD TO MAKE THE VIRUS AND A PHARMACEUTICAL COMPO- 
5 SITION COMPRISING THE VIRUS 

The object of the present invention was to make a vaccination virus This 
objective has been fulfilled with the segmented virus constructed as described 
herein 

The genome of influenza A viruses consists of 8 different single-stranded viral 
RNA (vRNA) molecules of negative polarity, which have in common 5* and 3' 
terminal sequences largely complementary to each other. These conserved 
segments 13 and 12 nucleotides in length are known to form double-stranded RNA 
panhandle structures (Hsu et al , 1987; Fodor et al., 1993) which have been 
analysed in more detail recently in vitro using internally deleted model RNAs 
(Baudin et al., 1994, Tiley et al., 1994). In the virion the panhandle ends of all 
RNA segments are found in specific binding to viral RNA polymerase complexes, 
while the remaining internal segments stay single-stranded with viral nucleoprotein 
(NP) in cooperative binding (Compans et al., 1972, Honda et al., 1988; Martin et 
al., 1992). Upon infection these viral RNPs initially serve as templates for the 
synthesis of viral mRNAs by a specific cap-snatching mechanism (Plotch et al 
1979, Braam et al , 1983), and later on will direct synthesis of full-length 
complementary RNAs (cRNAs), probably dependent on the absence or presence of 
newly synthesized NP protein (Shapiro and Krug, 1988). The plus-strand cRNAs 
are then used as templates for progeny vRNA synthesis 

The viral RNA polymerase complex consisting of proteins PB1, PB2, and PA is 
involved in all three different modes of RNA synthesis during the viral replication 
cycle, following its specific binding to the terminal panhandle segments of both 
vRNAs and cRNAs. Sequence comparison reveals that the vRNA and cRNA 
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termini have similar, but not identical sequences. For that reason vRNA and 
cRNA recognition may be distinguished because of these structural alterations 
allowing for asymmetries in initiation of plus and minus strand RNA synthesis, 
and possibly in viral RNP packaging, which has also been suggested to be 
5 controlled by the panhandle RNA sequence (Hsu et al., 1987) 

Recentiy, we reported on an in vivo system for the introduction of specific 
mutations into the genome of influenza viruses: viral cDNA has been inserted in 
antisense orientation between mouse rDNA promoter and terminator sequences 
This has been derived from in vitro transcription experiments based on nuclear 

10 extracts from Ehrlich ascites cells, which resulted in transcripts exacdy resembling 
influenza vRNA For a series of in vivo studies, the viral coding sequence was 
replaced by the coding sequence for chloramphenicol-acetyltransferase (CAT), 
however, with both influenza terminal non-coding sequences being retained 
exactly on the resulting vRNA transcripts. After transfection of this recombinant 

15 DNA template into mouse cells followed by influenza virus infection, CAT 
activity was detectable Transfer of supernatants to different cells demonstrated 
that CAT-vRNAs transcribed in vivo by cellular RNA polymerase I were not only 
transcribed by viral RNA polymerase into plus-strand mRNA and translated into 
CAT protein, but also were replicated and packaged into infectious progeny virus 

20 particles (Zobel et al., 1993, Neumann et al., 1994). 

We have used this system for a stepwise introduction of single and multiple 
mutations into the conserved panhandle RNA sequence, thereby effectively 
converting the HA-vRNA promoter sequence into an HA-cRNA promoter 
sequence and vice versa For these series of constructs CAT activities have been 
25 measured both in primarily transfected and infected B82 cells and, after passaging 
of B82 supernatants, in secondarily infected MDCK cells. From the results 
obtained we propose a model for the terminal RNA sequence as being recognized 
RNA polymerase in consecutive steps of different structure when used as a 
template for initiation of viral mRNA synthesis. 

30 The present invention relates to a segmented . RNA virus which comprises one or 
more segments which have been genetically modified to show improved 
transcription, replication and/or expression rates 
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The virus can be one where one or more modifications have been introduced in 
the noncoding region(s) and/or one or more modifications have been introduced in 
the coding region(s). A possibility is that at least one modified segment is derived 
from an original one by sequence variation(s). It is also possible that at least one 
modified segment is an artificial addition to the set of original or modified original 
segments. The virus can be one wherein the modified segment comprises a 
nucleotide sequence which codes for a protein or peptide which is foreign to the 
original virus. Preferred is that the foreign protein or peptide constitutes an antigen 
or antigen-like sequence, a T-cell epitope or related sequence In such a case it is 
possible that the segment comprises repetitions of an antigen or epitope or other. 
Such an antigen or epitope can be for example derived from HIV, Herpes-Virus, 
Human Papilloma virus, Rhinovirus, CMV or Hog Cholera Virus (HCV) The 
virus of the present invention may be a single stranded negative-strand RNA virus 
as for example one of the Orthomyxoviridae family, the Bunyaviridae family or of 
the Arenaviridae family The most preferred virus is an influenza virus. The virus 
of the present invention can also be a double-stranded RNA virus, as for example 
a reovirus, a rotavirus or an orbivirus. The viruses of the present invention can be 
used in gen-therapy. 



The present invention also relates to the virus and use of the virus for the 
20 preparation of pharmaceuticals. 

Mutational analysis of vRNA 3' terminal sequence positions. 

Influenza A viral RNA 5' and 3' ends have similar, but not identical sequences 
with nucleotide mismatches at positions 3, 5 and 8, and an additional unpaired 
nucleotide is located at position 10 in the 5' region Nevertheless, both vRNA 

25 termini hybridize into a double stranded panhandle structure made up of twelve 
and thirteen nucleotides in common for all eight RNA segments, plus in average 
three additional basepairs specific for each of the vRNA molecules. Due to the 
deviations mentioned the cRNA or plus strand panhandle structures have to be 
different from the vRNA structures; however, both are recognized by viral RNA 

30 polymerase and are used for initiation of RNA synthesis, i.e. they are constituting 
a promoter structure. Even if in initial recognition and binding of RNA 
polymerase the double stranded RNA panhandle structure is known to be the 
substrate, and is also observed in virion RNPs (Hsu et al., 1987), for the initiation 
step of transcription at the 3' ultimate template position this terminal region has to 
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be separated into a partially single stranded, i.e. 'forked' structure (Fodor et al., 
1994). RNA polymerase may be predicted to continue its binding interaction with 
both, the remaining double stranded segment: nucleotides 10 to 15 versus 11 to 
16 1 , and to the single stranded 3' template segment: nucleotides 1 up to 9 , as well 
5 as the 5' single stranded end (Tiley et al., 1994). Introduction of mutations at 
specific positions in either strand may hence alterate simultaneously both of these 
consecutive vRNA promoter structures: panhandle and fork in different ways, and 
will in addition also result in corresponding variations of the cRNA promoter 
structure. 

10 To investigate the importance of the three mismatch positions, specific single, 
double or triple nucleotide exchanges were first introduced into the vRNA 3* end 
sequence at positions 1,5 and 8" , thereby approaching a fully double-stranded 
vRNA promoter structure, in a step-wise manner. At the same time the vRNA 3' 
end template sequence will become equivalent to the cRNA 3' end in these 
15 positions, but not in regard to the additional nucleotide at position 10. Single 
nucleotide exchanges according to this scheme (pHL1098, pHL1099, pHLHOO) 
abolished the promoter activity, and no CAT activity was observed, as has been 
reported before with a different method (Luo et al., 1993). Two of the double 
mutation constructs (pHLHOl, pHL1103) also gave negative results. 

20 In contrast, for pHL1102 (G3A, U8C) 1 a significant CAT activity was detected, 
distinctly higher than for the corresponding wild-type construct (pHL926;) which 
in the conditions applied (8 hr after infection) resulted in rather low levels of CAT 
expression. This activity increase is further enhanced for the final construct of this 
series carn/ing the triple exchange G3A, C5U and U8C (pHL1104), i e trans- 

25 fection of pHL1104 DNA followed by influenza virus infection resulted in a very 
high level of CAT expression, also considerably above the pHLl 102 results. 

These results have been repeated using various conditions of transfection and 
infection as well as determining kinetic data during the course of infection. While 
the pHL1104 variant is always observed far superior over any wild-type construct 



30 'Notations concerning nucleotides! to 15 refer to positions in the vRNA 3' end, 
e.g. position 2 designates the penultimate nucleotide; 5' end positions are given in 
ordinary numbers. The notation G3A describes a mutational change of guanosine 
to adenosine at position 3 
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that expression ratio may be variable and difficult to quantitate (between around 
20 fold and nearly 100 fold). Rather short infective cycles of eight hours as used 
prevalently appear to put more slowly replicating, i.e. wild-type molecules at a 
disadvantage, in particular in passaging of packaged pseudo-vRNA molecules via 
virus progeny, both is found increased for wildtype and related constructs after 
DNA transfection plus twelve hours of infection (see Neumann et al , 1994). 
Remaining deviations in CAT expression ratios may be attributed to variations in 
growth conditions in individual experiments 

Mutational analysis of vRNA 5' terminal sequence positions 

We also addressed the question whether the unexpectedly high viral mRNA 
expression rate of P HL1104 is the consequence of a stabilized panhandle double- 
strand structure or may be directly attributed to the point mutations introduced into 
the vRNA 3' sequence, and active when being used as a single-stranded template 
segment, e.g. in the 'forked' structure. 

For this purpose we constructed pHL1124, three complementary point mutations 
introduced at the 5' end of the vRNA sequence again in positions 3, 5 and 8 
(U3C, G5A, A8G). Together with a sequence wild-type vRNA 3' end these 
variations again result in a panhandle structure free of mismatches and, therefore, 
pHL 1 124 is equivalent in this regard to pHLl 104, but different in the sequence of 
its template and non-template single strands No significant CAT expression was 
detected for P HL1124 We conclude that the increased CAT activity of pHLl 104 
is not a consequence of the stabilized panhandle structure itself, but at least in part 
is a consequence of the individual nucleotide exchanges at positions T, 5 and 8 at 
the 3' end of the vRNA sequence, it is also more likely then to originate from 
other structural intermediates of initiation than a stabilized panhandle. 

Mutational analyses of concerted exchanges at both ends of the vRNA 
sequence 

In order to determine in detail the influence of single, double and triple exchanges 
at the vRNA 5' end upon CAT expression rates we also used the improved vRNA 
3' end sequences of pHL1104 and pHLl 102 as starting points rather than the 
corresponding wild-type sequence. From the series of experiments related to 
pHLl 104 and from the equivalent series related to pHL 1 102 it can be concluded 
that retaining a G residue in position 5 is the most important single feature in 
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these 5* end variations. A single exchange into an A residue at position 5 as in 
pHL1185 will render the promoter entirely inactive, while single exchanges in 
positions 3 or 8, as well as a 3 plus 8 double exchange will retain promoter 
activity even if reduced from the level observed for pHL1104, but still above 
5 wild-type expression rates While the G5A nucleotide substitution opposite 
nucleotide C5 in the 3' terminus results in losing one basepair (in the panhandle 
context) within the pHL1104 series, a basepair is indeed gained by exactly the 
same G5A exchange within the pHL1102 series, i.e. opposite the U5 residue as 
present in the pHL1102 vRNA 3' end. Since again the G5A exchange results in 
10 loss of promoter function inspite of gaining one basepair we conclude that the 
guanosine at position 5 may be important for RNA polymerase binding within the 
5' non-template single strand rather than being part of the panhandle double- 
stranded structure in this region. The importance of a G residue at this position 
has been shown earlier in a single-step mutational analysis (Li and Palese, 1992), 
15 while non-template strand binding of RNA polymerase has been studied recently 
in vitro (Tiley et aL, 1994). Different from the deleterious effect of an exchange at 
position 5 exchanges at positions 3 and in particular 8 are of minor importance 
The series of 5' nucleotide exchanges has also been repeated for the pHL1102 
version of the vRNA 3' end yielding exactly the same pattern of results, albeit at 
20 the somewhat reduced levels characteristic for pHL1102. The only result in both 
series not quite in agreement with a uniquely important role for a G-residue in 
position 5 is the triple exchange of pHL1126 which retains low promoter activity 
inspite of an A residue in that position. Due to altogether six concerted exchanges 
in positions 3, 5 and 8 as well as 3 ,5 and 8 from the 5' and 3' end of the 
25 vRNA sequence the pHL1126 vRNA panhandle structure is indeed nearly 
equivalent to a wild-type cRNA panhandle, with the exception of an unpaired 
adenosine being present in position 10 of pHL1126 while an unpaired uridine at 
position fO is part of the wild-type cRNA structure. This correlation may indicate 
a correct structure in pHL1126 for several other residues of (minor) importance 
30 which, therefore, apparently allows to compensate for the missing G residue in 
position 5, even if at a clearly reduced level of activity. - In the parallel pHL1102 
series the corresponding triple exchange clone pHL1125 does not show any 
promoter activity; however, because of its deviation at position 5 it does not 
completely resemble the cRNA panhandle structure 
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Mutational analysis or the panhandle bulge structure around nucleotide 10 

An extra, unpaired residue in position 10 at the 5' end is a specific feature of the 
influenza viral RNA panhandle structure. It is causing or at least enforcing a major 
bulge of the structure, together with unpaired residues at position 9, and might be 

5 part of a specific recognition element of that structure by viral RNA polymerase 
In order to investigate the importance of that particular structural feature, a further 
series of plasmid constructs has been initiated, again based on pHL1104 and its 3' 
terminal sequence as a reference. A perfectly matched RNA double-strand without 
any bulge has been achieved either by inserting an additional U residue in the 3' 

10 end sequence opposite A10 (pHL1140) or by deleting the A10 residue from the 5' 
sequence (pHL1152). Finally, a bulge of opposite direction was created in the 
panhandle structure of pHL1164 with an extra U residue in position 10 of the 
3 1 end, and position 10 deleted from the 5' end sequence While the latter two 
constructs proved inactive in the CAT assay, pHLlHO did show some promoter 

15 activity, albeit at a reduced level. We conclude from this result that a bulge in this 
region may not be recognized directly by viral RNA polymerase but may serve as 
a flexible joint between two more rigid structural elements that are involved in 
immediate contact with viral polymerase. The necessary RNA bending may also, 
but less efficiently be achieved in an A*U-basepaired structure like pHL1140, 

20 while the other two structures would not permit such type of interaction with RNA 
polymerase. This interpretation has also been substantiated in a further series of 
variations in this region 

Serial passaging of influenza virus carrying promoter mutants 

All previous experiments consisted of a first measurement of viral mRNA 
25 synthesis in DNA-transfected and infected B82 cells, followed by a second 
measurement of viral mRNA synthesis in infected MDCK cells, after passaging of 
progeny virus containing supernatants. CAT expression in infected cells upon viral 
passaging requires packaging of pseudo-viral vRNAs, in addition to new rounds of 
viral mRNA synthesis in those cells leading to CAT expression again. All viral 
30 promoter mutants analysed and found active in transfected and helper-infected B82 
cells also resulted in CAT expression after transfer, and consistently in equivalent 
ratios of activity. Packaging, therefore, cannot be correlated with any specific 
element in the vRNA promoter structure so far, and does not appear to be a 
limiting factor in constructing influenza virus mutants in this system While CAT 
35 expression after passaging in general appeared to be increased over the levels 
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before passaging this might have been simply the result of different cells being 
used for the first and second step of CAT analysis, with MDCK being superior to 
B82 cells in influenza mRNA synthesis and also in progeny yields. Therefore, 
several experiments of serial passage have been performed using pHL1104 derived 

5 influenza supernatants and others, in MDCK cells. In these serial passages, always 
done using aliquots of supernatants harvested eight hours after infection for further 
transfer, a stepwise increase of CAT expression is observed (Fig 2). Apparently 
the superior performance of viral RNA promoters carrying sequence deviations 
according to pHLl 104 is not only true for viral mRNA synthesis, but also for viral 

10 RNA replication. 

Therefore, mutant viral RNAs of this character become accumulated and 
effectively selected in further passaging, while packaging may be a neutral event 
in this regard, at least for the variants analysed here. 

Serial passaging extended 

15 During further passaging of supernatants the CAT containing influenza segment 
carrying the mutationally altered viral promotor sequences became accumulated in 
a stepwise manner in the population of progeny viruses. In order to demonstrate 
this effect on the level of individual viruses being transferred we isolated in three 
indepentent experiments 50 to 85 plaques each after a third round of passage on 

20 MDCK cells. Each cell lysate obtained for the individual plaques was assayed for 
CAT activity according to the standard protocol While in two of the experiments 
the fraction of CAT positive plaques was in the range of 4 to 8% (1 out of 50, 4 
out of 40 plaques) in one of these series this fraction amounted to 47% (19 of 40 
plaques). Both of these results demonstrate a substantial increase over the initial 

25 fraction of CAT-segment containing virus, which may be calculated to be in the 
range of 10" 5 or at most 10" 4 , and slight variations in the conditions of growth 
during three steps of transfer may precipitate to result in the observed differences 
of CAT positive plaques. While every CAT positive plaque demonstrates the 
amplification of nine (not eight) viral RNA segments present in the initially 

30 infected cell, this may have resulted from a single virus carrying nine or more 
RNA segments or from coinfection by two defective viruses able to complement 
each other. 
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Necessarily, accumulation of a pseudo-viral segment not contributing to viral 
growth will, in further steps, become lethal to viral growth, even if a majority of 
virions may contain an average of eleven rather than eight RNA segments (Hsu et. 
al. 1987). Packaging of viral RNA-segments based on a general packaging signal 
identical for all eight segments and realized via a specific interaction chain: vRNA 
panhandle structure - viral RNA polymerase - viral NP protein - viral Ml protein 
will reflect the pools of the various vRNA segments in infected cells, and 
therefore may be biased towards an RNA segment superior in replication and 
overrepresented in that pool. Biased replication and packaging will, however, lead 
to accumulation of lethal viral particles due to an imbalance between the eight (or 
nine) viral RNA segments. This prediction is borne out in continuing the viral 
passage of pHL1104 derived influenza supernatants beyond step three as 
exemplified in Fig. 2 While CAT expression based on transcription of the 
pHL!104 derived pseudoviral RNA segments is increased further up to the fifth 
passage the number of viable viruses reaches a maximum already after the second 
step of viral passaging, thus demonstrating the continuous accumulation of an 
over-replicated foreign segment, based on a superior panhandle sequence. 

At a stage representing the third or fourth passage as displayed in Fig. 2 a vims 
preparation obtained in this way can be regarded as the equivalent of an attenuated 
viral strain While the concentration of attenuated virus particles that can be 
achieved in this way may appear to be Jimited a stage equivalent to passage 4 in 
Fig. 2 may be delayed upon coinfection with wildtype helper virus during first or 
second steps of transfer, and considerably increased concentrations of attenuated 
virus preparations might be achievable in this way 

pHLl 104-mediated high-rate expression of foreign proteins can also be used (after 
two or more steps of amplification via serial passaging on MDCK cells) for high 
rate synthesis of foreign proteins in embryonated chicken eggs, following a 
general method of preparation of viral stocks as used for influenza and other 
viruses, i. e. injection of virus suspensions into the yolk sac. Protein preparations 
isolated from those embryonated and infected cells will be glykosylated and 
modified in other ways according to their origination from eukaryotic cells. 

A second method of influenza virus attenuation has been achieved via cleavage of 
either one of the influenza viral RNAs, preferrably the M or NP gene (segments 7 



- 9 - 



WO 96/10641 




PCT/EP95/03663 



or 5), via ribozyme hydrolysis in a specialised mode of action. The ribozyme 
RNAs which may be covalently inserted into the pSV2neo early mRNA, located 
between the neomycin resistance gene and the small t intron sequence originating 
from SV40 viral DNA, or expressed from similar expression cassettes, are directed 
5 against the 5' end sequence of segment 7 (or another of the influenza vRNA 
segments) 

During initiation of mRNA synthesis the 5' terminal sequence which is involved in 
formation of the panhandle structure is at first covered by viral RNA polymerase 
in association with that double stranded promoter region. It will, however, become 

10 single-stranded and free of protein, since the polymerase molecule will start 
transcription at the 3' end and move along the 3' template sequence while 
synthezising a perfectly hybridizing 5' RNA daughter strand, superior in that 
regard to the parental panhandle 5 1 segment. Ribozyme RNAs which may be 
inhibited in their activity either by RNA substrates involved in double strand 

15 formation or if RNA substrates are covered by protein, have been directed with a 
3* complementary sequence towards that protein-free 5' sequence of the substrate 
vRNA molecule for initiation of hybridization, which then will be extended across 
the entire complementary region of approximately 100 nucleotides i. e. well into 
the vRNA sequence initially covered by NP protein. 

20 A second feature of the ribozyme RNAs as applied for inactivation of influenza 
vRNA molecules is their double-unit hammerhead character, directed against not 
one, but two close GUY cleavage sites, e. g. GUU 16 and GUU 36 in segment 7 or 
GUC 30 and GUC 4g in segment 5, which are also known to be invariable in 
sequence comparisons of influenza isolates 

25 Both features of anti-influenza ribozymes as pointed out contribute to a reduction 
of typically two logs (up to three logs) in production of viral progeny in template 
ribozyme DNA transfected cells as compared to infection of mock transfected 
cells, both at moi 1 and 20 h after Lipofectamin-DNA treatment. Ribozyme 
treatment can be applied after two or three rounds in MDCK cells of pHL1104- 

30 promoted amplification of a pseudo-viral RNA segment originating from RNA- 
Polymerase I transcription, in the presence of helper virus as used in initial 
superinfection. 

- 10 - 
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In a simple version ribozyme treatment as described above is employed as a 
selection technique. Here, its application is appropriate if the pseudoviral RNA is 
indeed a (foreign) influenza segment carrying particular mutations but capable in 
principle to act as a functional substitute for the helper viral segment destroyed by 
5 ribozyme cleavage. For that purpose the substitute viral segment to be selected in 
that procedure has to be mutagenized in advance at the two cleavage sites 
indicated above in order to become resistant against ribozyme interference. In 
another application ribozyme cleavage of helpervirus vRNA can be used for 
attenuation of recombinant influenza virus preparations. Here, the pseudo-viral 

10 RNA segment may be designed in a way which renders it incapable to substitute 
for a helpervirus gene. Therefore, viral passaging into ribozyme template DNA 
transfected cells would lead to an abortive infection only, because of ribozyme 
mediated destruction of an important viral gene, if its gene product would not be 
added for complementation via expression from a cDNA construct which is also 

15 DNA-transfected into the cell together with ribozyme-expressing DNA 20 h before 
viral infection. In this way viral progeny is obtained that is attenuated because of 
ribozyme cleavage of one of the vRNA segments, and effectively that segment is 
missing in the virions because it can no longer be packaged Viral preparations 
obtained in this way are capable of only one round of infection because of their 

20 inherent Ml + M2 protein complementation, and therefore are suited for 
vaccination purposes. Animal infection with progeny virus as isolated after the 
ribozyme attenuation step results in. abortive infection, but viral proteins 
synthesized in infected cells are able to induce B-cell and T-cell responses in such s 
animals 

25 In influenza viral RNA synthesis parental negative-strand vRNA is copied into 
plus-strand cRNA, which again is copied into progeny vRNA, from the first to the 
last nucleotide. This amplification of viral RNAs, however, proceeds in an 
inherently asymmetric way, since vRNA molecules are synthesized in excess over 
cRNA molecules. This result is consistent with the idea that cRNA carries a 

30 promoter structure more active in binding viral RNA polymerase and in initiation 
of RNA synthesis, i.e. 'stronger* than does vRNA While at first simply the two 
3' ends of single-stranded vRNA and cRNA templates have been implicated as 
promoter sequences, the detection of double-stranded panhandle structures 
involving both ends of the vRNA sequence in virions (Hsu et al., 1987) suggested 

35 more complicated substrates for RNA polymerase binding and initation of 
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daughter-strand synthesis. A slightly different panhandle structure has also been 
observed with model vRNA molecules in the absence of viral proteins in vitro 
(Baudin et al., 1994), possibly calling for a structural change upon viral RNA 
polymerase binding, i.e. a bulge may be shifted from position 4 to position 10 in 

5 that reaction (see Fig. 1). While originally several of the RNA polymerase / vRNA 
binding experiments in vitro appeared to show recognition only of 3' end 
oligonucleotides, this has since been shown to be an artifact after pure, 
recombinant viral polymerase free of residual RNA became available, instead of 
enzyme preparations from virions. Under these conditions RNA polymerase 

10 binding to viral RNA as well as endonucleolytic cleavage of cellular mRNAs by 
subunit PB2 was observed to depend on vRNA 5' plus 3' terminal sequence 
binding, with even higher affinity for the 5' non-template segment (Hagen et al., 
1994, Tiley et al., 1994). 

Different from the employment of both vRNA and cRNA promoter structures in 

15 replication physiologically only vRNA promoters will also serve in initiation of 
viral mRNA synthesis according to the cap-snatching mechanism (Plotch et al., 
1979, Braam et al, 1983) While it has been claimed that cRNA promoters would 
not have the capacity to act according to this scheme (Tiley et al., 1994), the 
failure to observe viral antisense mRNA molecules may simply reflect the 

20 inavailability of cRNA molecules early in infection, i.e. in the absence of surplus 
viral NP protein, and small amounts of such molecules might even have gone 
undetected. In this invention we describe a mutagenizational analysis of the vRNA 
promoter structure in vivo which in approaching the structure of the cRNA 
promoter via three nucleotide exchanges shows considerably improved activity in 

25 viral mRNA synthesis over vRNA promoter wild-type levels. Continuing increase 
of viral CAT mRNA expression during consecutive steps of viral passaging 
suggests that the same vRNA promoter mutants also show increased activity in 
cRNA synthesis, both in accordance with the idea that the cRNA promoter 
structure might be 'stronger' than the vRNA promoter, also in initiation of viral 

30 mRNA synthesis 

Additional variations of the 5' terminal sequence clearly indicate the major 
importance of a G residue in position 5, irrespective of complementarity or not to 
position 5 at the 3' end. The unique role of this G residue has been observed 
before in a serial mutagenizational analysis (Li and Palese, 1992) According to 
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both data guanosine residue 5 may be involved in single-strand binding of RNA 
polymerase as has indeed been observed for the non-template strand terminal 
segment (Tiley et al., 1994). While panhandle double-strand structures are likely to 
constitute the initial RNA polymerase binding substrate a partial separation of 
template and non-template strands is expected to take place consecutively resulting 
in a 'forked structure' such as proposed by Fodor et al. (1994). Specific and tight 
binding of RNA polymerase in this structure may predominantly be oriented 
towards sequence elements in the non-template strand, since the growing point of 
RNA synthesis will have to move along the entire template strand following its 
initiation. It is, therefore, possible that such a binding interaction survives most or 
all of an individual round of mRNA synthesis as has been proposed (Tiley et al 
1994). 

The triple nucleotide exchanges as introduced in vRNA molecules derived from 
pHL1104 templates will create three additional basepairs able to stabilize the 

15 panhandle structure in general, but more specifically they will favor a bulged 
adenosine 10 over the bulged adenosine 4 conformation as observed for the wild- 
type sequence in vitro (Baudin et al., 1994) Since the changes mtroduced here 
lead to a considerable enhancement of promoter activity we propose that a bulged 
10 conformation may be the structure underlying the vRNA / polymerase binding 

20 reaction, which otherwise would have to be achieved only as a result of that 
interaction. A bulged 10 adenosine residue may constitute a kind of flexible joint 
or angular kink which in turn suggests two major, structurally stable binding sites 
to the left and right of this element. One of these sites has to be the double- 
stranded sequence element of (in average) six basepairs extending from positions 

25 11 to 16 and 10 to 15 , respectively. While the distal three basepairs are known 
to be variable for the various RNA segments, basepair 13/12 has been shown to be 
exchangeable experimentally, and also the number of basepairs has been reduced 
to four without complete loss of function (Luo et al., 1991) With all of these data 
it seems clear that the main recognition element in this region is an RNA double- 

30 strand of certain stability, while it remains possible that residue 12 guanosine and 
potentially others are also recognized individually within that structure A major 
second binding element for RNA polymerase on the other side relative to position 
10 is less evident, but may be located in a distance of nearly one helical turn in 
the de-bulged region around position 4, since direct contacts are suggested by that 

35 initial conformational interaction, and also by the specific requirement of a 
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guanosine residue in position 5, which is likely to interact not only during, but 
also before partial strand separation in that region, i.e in the panhandle as well as 
the forked structure. While an extra adenosine residue in position 10 may be 
optimal for creating a correctly shaped bulge in this region of RNA, structural 
5 variants are possible in this regard (see pHL 1140) which excludes direct 
interactions between RNA polymerase and residues constituting that bulge. 

In summary we are proposing a model (see Fig 1) of consecutive steps of 
interaction between a vRNA or cRNA promoter structure and viral RNA 
polymerase: 

10 bulged 4 panhandle -> bulged 10 panhandle / polymerase -> forked RNA / 
polymerase (bound to 5G and ds element 11-16) -» initiation of RNA synthesis 
(recognition of 3' end of template) 

Attenuation of influenza viruses for preparation of a live nasal vaccine relies on 
two mechanisms: 1) preferential amplification of a recombinant viral segment 
15 carrying the pHL1104 promoter mutation, which will increase its rate in packaged 
viral RNP particles and indirectly decrease that of the eight helper virus RNP 
particles This competition results in an increase of defective viruses from which 
one or more of the regular viral gene segments are missing. 2) Sequence specific 
ribozyme cleavage of one or more helper virus RNA segments, if compensated 
20 through gene product expression for functional complementation. This dual 
interaction will result in virus progeny, which is capable of only one round of 
infection, abortive because of the missing viral protein(s) that are required for their 
propagation. - Ribozyme cleavage of one out of two sister viral gene segments, 
sensitive and (artificially) insensitive for its hydrolysis may also be used 
25 (repeatedly) for selection purposes, including selection for viral gene constructs 
expressed via RNA polymerase I transcription. 

MATERIALS AND METHODS 

Plasmid constructions 

Plasmids with mutated vRNA and/or mutated cRNA promoter sequences are 
30 derivatives of pHL926 (Zobel et al., 1993; Neumann et al., 1994). In pHL926 a 
hybrid CAT cDNA with flanking non-coding sequences derived from influenza 
vRNA segments has been precisely inserted in anti sense orientation between 
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mouse rDNA promoter and terminator sequences. The CAT reporter gene in this 
way has been introduced by exactly replacing the coding sequence for 
hemagglutinin, retaining the untranslated viral 5' and 3' sequences of segment 4 

vRNA 5' end mutations were created by PCR, using a general primer hybridizing 
5 to a position in the flanking rDNA promoter sequence, and a specific primer 
carrying the desired nucleotide substitution to be introduced in the viral terminal 
sequence The polymerase chain reaction products were first digested by the 
restriction enzymes Bglll and Spel, inserted into the left boundary position by 
exchanging the segment between these appropriate restricton sites in pHL926, and 
10 finally confirmed in their constitution by DNA Sanger sequencing 

Generation of vRNA 3* end mutations followed the same general scheme at the 
right boundary. PCR products were obtained by using a general primer 
complementary to a CAT gene internal sequence position, and a specific primer 
with appropriate nucleotide exchanges inserted into its sequence Following 
15 digestion with restriction enzymes Ncol and Seal the PCR products were cloned 
into Ncol- and Scal(partially)-digested plasmid pHL926 Any PCR derived 
sequences were investigated by DNA sequencing 

For constructs with both 5* end and 3' end mutations in combination, 5' variation 
containing fragments were obtained by Bglll and Spel restriction and inserted into 
20 the appropriate 3' terminal variation plasmids 

Cells and viruses 

Influenza A/FPV/Bratislava viruses were grown in NIH3T3 cells For transfection 
and passaging experiments B82 cells (a mouse L cell line) and MDCK cells were 
used 

25 Lipofectamin DNA transfection and influenza virus helper infection 

For DNA transfection 10 7 B82 cells were used 5 |ig of plasmid DNA were mixed 
with 60 \ig of Lipofectamin (Lipofectamin™, GIBCO/BRL) in serum-free medium 
and incubated at room temperature for 10-15 min This mixture was added to the 
cells washed twice with serum-free medium, and the incubation with 
30 Lipofectamin/DNA was continued for 1 hr. After further incubation with DMEM 
medium for 1 hr the transfected B82 cells were infected with influenza 
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A/FPV/Bratislava at a multiplicity of infection of 0.01 to 1 for another 30-60 min. 
Further incubation was performed with DMEM medium 

Passaging of virus containing supernatants 

Under standard conditions 8 hr after influenza infection (at moi 0.1 to 1) cells 
5 were harvested for CAT assays, and supernatants were collected and spun down at 
1200 rpm for 5 min for removal of cell debris 

Aiiquots of virus containing cleared supernatants were used for plaque tests, and 
another aliquot was adsorbed to 10 7 MDCK cells for 30-60 min for further 
passaging Again 8 hr after infection the CPE was verified, and cells and 
10 supernatants were collected and treated as before. 

CAT assay 

Cell extracts were prepared as desribed by Gorman et al. (1982) CAT assays were 
done with [ 14 C]chlorarnphenicol or fluorescent-labeled chloramphenicol 
(borondipyrromethane difluoride fluorophore; FLASH CAT Kit, Stratagene) as 
15 substrates 

For [ l4 C]chloramphenicol the assay mixture contained: 0 1 pCi 
[ ,4 C]chloramphenicol, 20 pi 4 mM Acetyl-CoA, 25 pal 1 M Tris-HCl (pH 7.5) and 
50 pi of cell lysate in a total volume of 150 pi. The assay mixture for the 
fluorescent-labeled substrate contained (in a final volume of 80 pi): 10 pi 0.25 M 
20 Tris-HCl (pH 7.5), 10 pi 4 M Acetyl-CoA, 10 pi fluorescent-labeled 
chloramphenicol, and 50 pi of cell lysate. After an incubation time of 16 hr the 
reaction products were separated by chromatography and either autoradiographed 
or visualized by UV illumination and photography 
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LEGENDS TO FIGURES 

Fig. 1. Proposed scheme of consecutive conformational steps occurring prior to 
initiation of viral mRNA synthesis in influenza vRNA, in wildtype and pHL1104 
derived mutant sequences. Positions of triple mutation in pHL1104 vRNA are 
25 indicated in bold and larger size letters. 

(A) Free RNA panhandle structure, bulged at position 4 (wildtype vRNA; Baudin 
et al, 1994) or at position 10 (mutant vRNA). (B) Bulged 10 panhandle structures 
after binding of viral RNA polymerase, proposed protein binding positions marked 
by underlignments (C) Forked structures of partial strand separation. (D) Initiation 
30 of viral mRNA synthesis via hybridization of capped primer oligonucleotide 

Fig. 2. Serial passaging of pHL 1 104-derived progeny viruses 
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10 7 B82 cells were transfected with 5 |a of pHL 1104 DNA (in 60 
Lipofectamin) and infected two hours later with influenza A/FPV/Bratislava 
(m.o.i:l). 8 hr post-infection the cells were assayed for plaque forming units (dark 
column) and for CAT activity (hatched column) After sedimentation an other 

5 aliquot of the supernatant was adsorbed to 10 7 MDCK cells. Further rounds of 
passaging were done equivalently by harvesting the cells 8 hr after infection for 
assaying CAT activities, whereas an aliquot of the supernatant was always 
adsorbed to fresh MDCK cells. Numbers of serial passages are indicated at the 
bottom Plaque forming units per ml refer to the left ordinate, CAT expression 

10 rates (relative to primary imfection levels) to the ordinate on the right. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

<i) APPLICANT: 

(A) NAME: Bayer AG 

(B) STREET: Bayerwerk 

(C) CITY: Leverkusen 

<E) COUNTRY: Deutechland 

(F) POSTAL CODE (ZIP) : 51368 

(G) TELEPHONE: (0)214-3061455 

(H) TELEFAX: (0)214-303482 



(ii) TITLE OF INVENTION: Vaccination virus, method of making it and 
pharmaceutical composition comprising that virus 

(iii) NUMBER OF SEQUENCES: 8 

15 (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30B (EPO) 

20 (2) INFORMATION FOR SEQ ID NO: 1: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza virus, RNA sequence 
(C) INDIVIDUAL ISOLATE: pHL926 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CTGGCGGCAG TAGCGCGGTG GTCCCACCTG ACCCCATGCC GAACTCAGAA GTGAAACGCC 6 0 

GTAGCGCCGA TGGTAGTGTG GGGTCTCCCC ATGCGAGAGT AGGGAACTGC CAGGCATCAA 12 0 

35 ATAAAACGAA AGGCTCAGTC GAAAGACTGG GCCTTTCGTT TTATCTGTTG TTTGTCGGTG 13 0 

AACGCTCTCC TGAGTAGGAC AAATCCGCCG GGAGCGGATT TGAACGTTGC GAAGCAACGG 24 0 
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CCCGGAGGGT GGCGGGCAGG ACGCCCGCCA TAAACTGCCA GGCATCAAAT TAAGCAGAAG 300 

GCCATCCTGA CGGATGGCCT TTTTGCGTTT CTACAAACTC TTTTGTTTAT TTTTCTAAAT 360 

ACATTCAAAT ATGTATCCGC TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG 4 20 

AAAAAGGAAG AGTATGAGTA TTCAACATTT CCGTGTCGCC CTTATTCCCT TTTTTGCGGC 4 80 

5 ATTTTGCCTT CCTGTTTTTG CTCACCCAGA AACGCTGGTG AAAGTAAAAG ATGCTGAAGA 54 0 

TCAGTTGGGT GCACGAGTGG GTTACATCGA ACTGGATCTC AACAGCGGTA AGATCCTTGA 6 00 

GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT GATGAGCACT TTTAAAGTTC TGCTATGTGG 66 0 

CGCGGTATTA TCCCGTGTTG ACGCCGGGCA AGAGCAACTC GGTCGCCGCA TACACTATTC 72 0 

TCAGAATGAC TTGGTTGAGT ACTCACCAGT CACAGAAAAG CATCTTACGG ATGGCATGAC 78 0 

10 AGTAAGAGAA TTATGCAGTG CTGCCATAAC CATGAGTGAT AACACTGCGG CCAACTTACT 84 0 

TCTGACAACG ATCGGAGGAC CGAAGGAGCT AACCGCTTTT TTGCACAACA TGGGGGATCA 90 0 

TGTAACTCGC CTTGATCGTT GGGAACCGGA GCTGAATGAA GCCATACCAA ACGACGAGCG 96 0 

TGACACCACG ATGCCTGCAG CAATGGCAAC AACGTTGCGC AAACTATTAA CTGGCGAACT 102 0 

ACTTACTCTA GCTTCCCGGC AACAATTAAT AGACTGGATG GAGGCGGATA AAGTTGCAGG 108 0 

15 ACCACTTCTG CGCTCGGCCC TTCCGGCTGG CTGGTTTATT GCTGATAAAT CTGGAGCCGG 114 0 

TGAGCGTGGG TCTCGCGGTA TCATTGCAGC ACTGGGGCCA GATGGTAAGC CCTCCCGTAT 1200 

CGTAGTTATC TACACGACGG GGAGTCAGGC AACTATGGAT GAACGAAATA GACAGATCGC 126 0 

TGAGATAGGT GCCTCACTGA TTAAGCATTG GTAACTGTCA GACCAAGTTT ACTCATATAT 13 2 0 

ACTTTAGATT GATTTAAAAC TTCATTTTTA ATTTAAAAGG ATCTAGGTGA AGATCCTTTT 1380 

20 TGATAATCTC ATGACCAAAA TCCCTTAACG TGAGTTTTCG TTCCACTGAG CGTCAGACCC 144 0 

CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT 1500 

GCAAACAAAA AAACCACCGC TACCAGCGGT GGTTTGTTTG CCGGATCAAG AGCTACCAAC 156 0 

TCTTTTTCCG AAGGTAACTG GCTTCAGCAG AGCGCAGAT A CCAAATACTG TCCTTCTAGT 162 0 

GTAGCCGTAG TTAGGCCACC ACTTCAAGAA CTCTGTAGC A CCGCCTACAT ACCTCGCTCT 1680 

25 GCTAATCCTG TTACCAGTGG CTGCTGCCAG TGGCGATAAG TCGTGTCTTA CCGGGTTGGA 174 0 

CTCAAGACGA TAGTTACCGG ATAAGGCGCA GCGGTCGGGC TGAACGGGGG GTTCGTGCAC 18 00 

ACAGCCCAGC TTGG AGCGAA CGACCTACAC CGAACTGAGA TACCTACAGC GTGAGCATTG I86 0 
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AGAAAGCGCC 
CGGAACAGGA 
TGTCGGGTTT 
GAGCCTATGG 
5 TGGCGGACGC 
AGAATTGATT 
CATTCAGGTC 
GTATAGGGCG 
AATCGCCGTG 

10 TCCTTGAAGC 
GGGCATCCCG 
CGTCGCGAAC 
TTACATTAAT 
TGCATTAATG 

15 GTTTTTCTTT 
GAGAGTTGCA 
GTGGTTGACG 
ATATCCGCAC 
TGATCGTTGG 

20 TGTTGAAAAC 
TTGCGAGTGA 
GGGCCCCCGG 
GATCCAAAGC 
CTTACACATC 

25 TTACGTTGAC 
AGAGAGTCAA 
TGCCGGTGTC 



ACGCTTCCCG AAGGGAGAAA 

GAGCGCACGA GGGAGCTTCC 

CGCCACCTCT GACTTGAGCG 

AAAAACGCCA GCAACGCGGC 

GATGGATATG TTCTGCCAAG 

GGCTCCAATT CTTGGAGTGG 

GAGGTGGCCC GGCTCCATGC 

GCGCCTACAA TCCATGCCAA 

ACGATCAGCG GTCCAGTGAT 

TGTCCCTGAT GGTCGTCATC 

ATGCCGCCGG AAGCGAGAAG 

GCCAGCAAGA CGTAGCCCAG 

TGCGTTGCGC TCACTGCCCG 

AATCGGCCAA CGCGCGGGGA 

TCACCAGTGA GACGGGCAAC 

GCAAGCGGTC CACGCTGGTT 

GCGGGATATA ACATGAGCTG 

CAACGCGCAG CCCGGACTCG 

CAACCAGCAT CGCAGTGGGA 

CGGACATGGC ACTCCAGTCG 

GATATTTATG CCAGCCAGCC 

TGACAGGGAC AGAGAGGGCT 

TCCAGGGCGA GCTCGAATTC 

CCAGCCCTGA AAAAGGGCAT 

ACCATCGAAT GGTGCAAAAC 

TTCAGGGTGG TGAATGTGAA 

TCTTATCAGA CCGTTTCCCG 



GGCGGACAGG TATCCGGTAA 

AGGGGGAAAC GCCTGGTATC 

TCGATTTTTG TGATGCTCGT 

CCGAGATGCG CCGCGTGCGG 

GGTTGGTTTG CGCATTCACA 

TGAATCCGTT AGCGAGGTGC 

ACCGCGACGC AACGCGGGGA 

CCCGTTCCAT GTGCTCGCCG 

CGAAGTTAGG CTGGTAAGAG 

TACCTGCCTG GACAGCATGG 

AATCATAATG GGGAAGGCCA 

CGCGTCGGCC AGCTTGCAAT 

CTTTCCAGTC GGGAAACCTG 

GAGGCGGTTT GCGTATTGGG 

AGCTGATTGC CCTTCACCGC 

TGCCCCAGCA GGCGAAAATC 

TCTTCGGTAT CGTCGTATCC 

GTAATGGCGC GCATTGCGCC 

ACGATGCCCT CATTCAGCAT 

CCTTCCCGTT CCGCTATCGG 

AGACGCAGAC GCGCCGAGAC 

TCTGGAGGAA AAAAGAAAAA 

CCCGGTAAAG CCGCTTAAGA 

CAAAATAAAC CACACCTATG 

CTTTCGCGGT ATGGCATGAT 

ACCAGTAACG TTATACGATG 

CGTGGTGAAC CAGGCCAGCC 



GCGGCAGGGT 1920 

TTTATAGTCC 198 0 

CAGGGGGGCG 2 04 0 

CTGCTGGAGA 2 100 

GTTCTCCGCA 2 16 0 

CGCCGGCTTC 222 0 

GGCAGACAAG 22 8 0 

AGGCGGCATA 2 34 0 

CCGCGAGCGA 24 00 

CCTGCAACGC 24 6 0 

TCCAGCCTCG 2 52 0 

TCGCGCTAAC 2 58 0 

TCGTGCCAGC 2 64 0 

CGCCAGGGTG 2 70 0 

CTGGCCCTGA 276 0 

CTGTTTG ATG 2 8 2 0 

CACTACCGAG 2 88 0 

CAGCGCCATC- 2 94 0 

TTGCATGGTT 3 000 

CTG AATTTG A 3 0 6 0 

AGAACTTAAT 3 12 0 

AAAAAAAAAA 3 18 0 

CATTCCCGCT 3 24 0 

GTGTATGCAT 3 300 

AGCGCCCGGA 336 0 

TCGCAGAGTA 34 2 0 

ACGTTTCTGC 34 8 0 
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GAAAACGCGG GAAAAAGTGG AAGCGGCGAT GGCGGAGCTG AATTACATTC CCAACCGCGT 3 54 0 

GGCACAACAA CTGGCGGGCA AACAGTCGTT GCTGATTGGC GTTGCCACCT CCAGTCTGGC 36 00 

CCTGCACGCG CCGTCGCAAA TTGTCGCGGC GATTAAATCT CGCGCCGATC AACTGGGTGC 3 66 0 

CAGCGTGGTG GTGTCGATGG TAGAACGAAG CGGCGTCGAA GCCTGTAAAG CGGCGGTGCA 3 72 0 

5 CAATCTTCTC GCGCAACGCG TTCAGTGGGC TAGATCTACC GCGGTAGATC ATTAACTATC 3 78 0 

CGCTGGATGA CCAGGATGCC ATTGCTGTGG AAGCTGCCTG CACTAATGTT CCGGCGTTAT 3 84 0 

TTCTTGATGT CTCTGACCAG ACACCCATCA ACAGTATTAT TTTCTCCCAT GAAGACGGTA 3900 

CGCGACTGGG CGTGGAGCAT CTGGTCGCAT TGGGTCACCA GCAAATCGCG CTGTTAGCGG 3 96 0 

GCCCTGTACG TCTGAGGCCG AGGGAAAGCT ATGGGCGCGG TTTTCTTTCA TTGACCTGTC 4 02 0 

10 GGTCTTATCA GTTCTCCGGG TTGTCAGGTC GACCAGTTGT TCCTTTGAGG TCCGGTTCTT 4 08 0 

TTCGTTATGG GGTCATTTTT GGGCCACCTC CCCAGGTATG ACTTCCAGGT ATTCTCTGTG 4 14 0 

GCCTGTCACT TTCCTCCCTG TCTCTTTTAT GCTTGTGATC TTTTAGATCT GGTCCTATTG 42 00 

GACCTGGAG A TAGGTAGTAG AAACAAGGGT GTTTTTAAAT ACTAGTACAT TACGCCCCGC 426 0 

CCTGCCACTC ATCGCAGTAC TGTTGTAATT CATTAAGCAT TCTGCCGACA TGGAAGCCAT 4 32 0 

15 CACAGACGGC ATGATGAACC TGAATCGCCA GCGGCATCAG CACCTTGTCG CCTTGCGTAT 43 8 0 

AATATTTGCC CATGGTGAAA ACGGGGGCGA AGAAGTTGTC CATATTGGCC ACGTTTAAAT 44 4 0 
CAAAACTGGT GAAACTCACC CAGGGATTGG CTGAGACGAA AAACATATTC TCAATAAACC 4 5 00 

CTTTAGGGAA ATAGGCCAGG TTTTCACCGT AACACGCCAC ATCTTGCGAA TATATGTGTA 4 56 0 

GAAACTGCCG GAAATCGTCG TGGTATTCAC TCCAGAGCGA TGAAAACGTT TCAGTTTGCT 4 62 0 

20 CATGGAAAAC GGTGTAACAA GGGTGAACAC TATCCCATAT CACCAGCTCA CCGTCTTTCA 468 0 

TTGCCATACG GAATTCCGGA TGAGCATTCA TCAGGCGGGC AAGAATGTGA ATAAAGGCCG 474 0 

GATAAAACTT GTGCTTATTT TTCTTTACGG TCTTTAAAAA GGCCGTAATA TCCAGCTGAA 4 80 0 

CGGTCTGGTT ATAGGTACAT TGAGCAACTG ACTGAAATGC CTCAAAATGT TCTTTACGAT 4 86 0 

GCCATTGGGA TATATCAACG GTGGTATATC CAGTGATTTT TTTCTCCATG ATTAATAGAA 4 92 0 

25 TTATCCCCTG TTTCTACTCC CCCCCAACTT CGGAGGTCGA CCAGTACTCC GGGCGAAACT 4 98 0 

TTGTTTTTTT TTTTTCCCCC GATGCTGGAG GTCGACCAGA TGTCCGAAAG TGTCCCCCCC 5 04 0 

CCCCCCCCCC CCCGGCGCGG AACGGCGGGG CCACTCTGGA CTCTTTTTTT TTTTTTTTTT 5100 
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TTTTTTTTTG GGGATCCTCT AGAGTCGACC TGCAGCCCAA GCTAGCGGCC GCTAGCTTCT 516 0 

GTTTTGGCGG ATGAGAGAAG ATTTTCAGCC TG AT AC AG AT T AAAT C AG A A CGCAGAAGCG 52 2 0 

GTCTGATAAA ACAGAATTTG C 5 24 1 
(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza virus, RNA sequence 
15 (C) INDIVIDUAL ISOLATE: pHL1104 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CTGGCGGCAG TAGCGCGGTG GTCCCACCTG ACCCCATGCC GAACTCAGAA GTG AAACGCC 6 0 

GTAGCGCCGA TGGTAGTGTG GGGTCTCCCC ATGCGAGAGT AGGGAACTGC CAGGCATCAA 12 0 

ATAAAACGAA AGGCTCAGTC GAAAGACTGG GCCTTTCGTT TTATCTGTTG TTTGTCGGTG 18 0 

AACGCTCTCC TGAGTAGGAC AAATCCGCCG GGAGCGGATT TGAACGTTGC GAAGCAACGG 24 0 

CCCGG AGGGT GGCGGGCAGG ACGCCCGCCA TAAACTGCC A GGCATCAAAT TAAGCAGAAG 300 

GCCATCCTGA CGG ATGGCCT TTTTGCGTTT CTACAAACTC TTTTGTTTAT TTTTCTAAAT 36 0 

AC ATT C AAAT ATGTATCCGC TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG 42 0 

AAAAAGGAAG AGTATGAGTA TTCAACATTT CCGTGTCGCC CTTATTCCCT TTTTTGCGGC 480 

ATTTTGCCTT CCTGTTTTTG CTCACCCAGA AACGCTGGTG AAAGTAAAAG ATGCTGAAGA 54 0 

TCAGTTGGGT GCACGAGTGG GTTACATCGA ACTGGATCTC AACAGCGGTA AGATCCTTGA 6 00 

GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT GATGAGCACT TTTAAAGTTC TGCTATGTGG 66 0 

CGCGGTATTA TCCCGTGTTG ACGCCGGGCA AGAGCAACTC GGTCGCCGCA TACACTATTC 720 

TCAGAATGAC TTGGTTGAGT ACTCACCAGT C AC AG AAA AG CATCTTACGG ATGGCATGAC 78 0 
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AGTAAGAGAA TTATGCAGTG CTGCCATAAC 
TCTGACAACG ATCGGAGGAC CGAAGGAGCT 
TGTAACTCGC CTTGATCGTT GGGAACCGGA 
TGACACCACG ATGCCTTCAG CAATGGCAAC 
5 ACTTACTCTA GCTTCCCGGC AACAATTAAT 

ACCACTTCTG CGCTCGGCCC TTCCGGCTGG 
TGAGCGTGGG TCTCGCGGTA TCATTGCAGC 
CGTAGTTATC TACACGACGG GGAGTCAGGC 
TGAGATAGGT GCCTCACTGA TTAAGCATTG 
10 ACTTTAGATT GATTTAAAAC TTCATTTTTA 

TGATAATCTC ATGACCAAAA TCCCTTAACG 
CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA 
GCAAACAAAA AAACCACCGC TACCAGCGGT 
TCTTTTTCCG AAGGTAACTG GCTTCAGCAG 
15 GTAGCCGTAG TTAGGCCACC ACTTCAAGAA 

GCTAATCCTG TTACCAGTGG CTGCTGCCAG 
CTCAAGACGA TAGTTACCGG ATAAGGCGCA 
ACAGCCCAGC TTGGAGCGAA CGACCTACAC 
AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA 
20 CGGAACAGGA GAGCGCACGA GGGAGCTTCC 

TGTCGGGTTT CGCCACCTCT GACTTGAGCG 
GAGCCTATGG AAAAACGCCA GCAACGCGGC 
TGGCGGACGC GATGGATATG TTCTGCCAAG 
ftGAATTGATT GGCTCCAATT CTTGGAGTGG 
25 CATTCAGGTC GAGGTGGCCC GGCTCCATGC 

GTATAGGGCG GCGCCTACAA TCCATGCCAA 
AATCGCCGTG ACGATCAGCG GTCCAGTGAT 
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CATGAGTGAT AACACTGCGG CCAACTTACT 84 0 

AACCGCTTTT TTGCACAACA TGGGGGATCA 90 0 

GCTGAATGAA GCCATACCAA ACGACGAGCG 96 0 

AACGTTGCGC AAACTATTAA CTGGCGAACT 102 0 

AGACTGGATG GAGGCGGATA AAGTTGCAGG 108 0 

CTGGTTTATT GCTGATAAAT CTGGAGCCGG 114 0 

ACTGGGGCCA GATGGTAAGC CCTCCCGTAT 1200 

AACTATGGAT GAACGAAATA GACAGATCGC 12 6 0 

GTAACTGTCA GACCAAGTTT ACTCATATAT 132 0 

ATTTAAAAGG ATCTAGGTGA AGATCCTTTT 13 8 0 

TGAGTTTTCG TTCCACTGAG CGTCAGACCC 144 0 

TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT 1500 

GGTTTGTTTG CCGGATCAAG AGCTACCAAC 156 0 

AGCGCAGATA CCAAATACTG TCCTTCTAGT 1620 

CTCTGTAGCA CCGCCTACAT ACCTCGCTCT 168 0 

TGGCGATAAG TCGTGTCTTA CCGGGTTGG A 174 0 

GCGGTCGGGC TGAACGGGGG GTTCGTGCAC 1800 

CGAACTGAGA TACCTACAGC GTGAGCATTG 186 0 

GGCGGACAGG TATCCGGTAA GCGGCAGGGT 192 0 

AGGGGGAAAC GCCTGGTATC TTTATAGTCC 198 0 

TCGATTTTTG TGATGCTCGT CAGGGGGGCG 2 04 0 

CCGAGATGCG CCGCGTGCGG CTGCTGGAGA 2100 

GGTTGGTTTG CGCATTCACA GTTCTCCGCA 2 16 0 

TGAATCCGTT AGCGAGGTGC CGCCGGCTTC 2220 

ACCGCGACGC AACGCGGGGA GGCAGACAAG 228 0 

CCCGTTCCAT GTGCTCGCCG AGGCGGCATA 234 0 

CGAAGTTAGG CTGGTAAGAG CCGCGAGCGA 24 0 0 



- 24 . 



BNSDOCID: <WO 961 0641 A 1 J_> 



WO 96/10641 




TCCTTGAAGC TGTCCCTGAT GGTCGTCATC 
GGGCATCCCG ATGCCGCCGG AAGCGAGAAG 
CGTCGCGAAC GCCAGCAAGA CGTAGCCCAG 
TTACATTAAT TGCGTTGCGC TCACTGCCCG 
5 TGCATTAATG AATCGGCCAA CGCGCGGGGA 

GTTTTTCTTT TCACCAGTGA GACGGGCAAC 
GAGAGTTGCA GCAAGCGGTC CACGCTGGTT 
GTGGTTGACG GCGGGATATA ACATGAGCTG 
ATATCCGCAC CAACGCGCAG CCCGGACTCG 
10 TGATCGTTGG CAACCAGCAT CGCAGTGGGA 

TGTTGAAAAC CGGACATGGC ACTCCAGTCG 
TTGCGAGTGA GATATTTATG CCAGCCAGCC 
GGGCCCCCGG TGACAGGGAC AG AGAGGGCT 
GATCCAAAGC TCCAGGGCGA GCTCGAATTC 
15 CTTACACATC CCAGCCCTGA AAAAGGGCAT 

TTACGTTGAC ACCATCGAAT GGTGCAAAAC 
AGAGAGTCAA TTCAGGGTGG TGAATGTGAA 
TGCCGGTGTC TCTTATCAGA CCGTTTCCCG 
GAAAACGCGG GAAAAAGTGG AAGCGGCGAT 
20 GGCACAACAA CTGGCGGGCA AACAGTCGTT 

CCTGCACGCG CCGTCGCAAA TTGTCGCGGC 
CAGCGTGGTG GTGTCGATGG TAGAACGAAG 
CAATCTTCTC GCGCAACGCG TTCAGTGGGC 
CGCTGGATGA CCAGGATGCC ATTGCTGTGG 
25 TTCTTGATGT CTCTG ACCAG ACACCCATCA 

CGCGACTGGG CGTGGAGCAT CTGGTCGCAT 
GCCCTGTACG TCTGAGGCCG AGGGAAAGCT 




PCT/EP95/03663 



TACCTGCCTG GACAGCATGG CCTGCAACGC 24 6 0 

AATCATAATG GGGAAGGCCA TCCAGCCTCG 2 52 0 

CGCGTCGGCC AGCTTGCAAT TCGCGCTAAC 2 53 0 

CTTTCCAGTC GGGAAACCTG TCGTGCCAGC 2 64 0 

GAGGCGGTTT GCGTATTGGG CGCCAGGGTG 27 0 0 

AGCTGATTGC CCTTCACCGC CTGGCCCTGA 2 76 0 

TGCCCCAGCA GGCGAAAATC CTGTTTGATG 2S2 0 

TCTTCGGTAT CGTCGT AT CC CACTACCGAG 2SS0 

GTAATGGCGC GCATTGCGCC CAGCGCCATC 2 94 0 

ACGATGCCCT CATTCAGCAT TTGCATGGTT 3 00 0 

CCTTCCCGTT CCGCTATCGG CTGAATTTGA 3 06 0 

AGACGCAGAC GCGCCGAGAC AGAACTTAAT 3 12 0 

TCTGGAGGAA AAA AG AAA A A AAAAAAAAAA 3 13 0 

CCCGGTAAAG CCGCTTAAGA CATTCCCGCT 324 0 

CAAAATAAAC CAC ACCTATG GTGT ATGCAT 3 3 00 

CTTTCGCGGT ATGGCATGAT AGCGCCCGGA 33 6 0 

ACCAGTAACG TT ATACGATG TCGCAGAGTA 3 4 2 0 

CGTGGTGAAC CAGGCCAGCC ACGTTTCTGC 34 SO 

GGCGGAGCTG AATTACATTC CCAACCGCGT 3 54 0 

GCTGATTGGC GTTGCCACCT CCAGTCTGGC 36 0 0 

GATTAAATCT CGCGCCGATC AACTGGGTGC 36 6 0 

CGGCGTCGAA GCCTGTAAAG CGGCGGTGCA 3720 

TAGATCTACC GCGGTAGATC ATTAACTATC 37 9 0 

AAGCTGCCTG CACTAATGTT CCGGCGTTAT 3 54 0 

ACAGTATTAT TTTCTCCCAT GAAGACGGTA 3 9 00 

TGGGTCACCA GCAAATCGCG CTGTTAGCGG 3 960 

ATGGGCGCGG TTTTCTTTCA TTGACCTGTC 4 020 
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GGTCTTATCA GTTCTCCGGG TTGTCAGGTC GACCAGTTGT TCCTTTGAGG TCCGGTTCTT 4 08 0 

TTCGTTATGG GGTCATTTTT GGGCCACCTC CCCAGGTATG ACTTCCAGGT ATTCTCTGTG 414 0 

GCCTGTCACT TTCCTCCCTG TCTCTTTTAT GCTTGTGATC TTTTAGATCT GGTCCTATTG 42 00 

GACCTGGAGA TAGGTAGTAG AAACAAGGGT GTTTTTAAAT ACTAGTACAT TACGCCCCGC 42 6 0 

CCTGCCACTC ATCGCAGTAC TGTTGTAATT CATTAAGCAT TCTGCCGACA TGGAAGCCAT 4 32 0 

CACAGACGGC ATGATGAACC TGAATCGCCA GCGGCATCAG CACCTTGTCG CCTTGCGTAT 4 38 0 

AATATTTGCC CATGGTGAAA ACGGGGGCGA AGAAGTTGTC CATATTGGCC ACGTTTAAAT 44 4 0 

CAAAACTGGT GAAACTCACC CAGGGATTGG CTGAGACGAA AAACATATTC TCAATAAACC 4 50 0 

CTTTAGGGAA ATAGGCCAGG TTTTCACCGT AACACGCCAC ATCTTGCGAA TATATGTGTA 4 56 0 

GAAACTGCCG GAAATCGTCG TGGTATTCAC TCCAGAGCGA TGAAAACGTT TCAGTTTGCT 462 0 

CATGG AAAAC GGTGTAACAA GGGTGAACAC TATCCCATAT CACCAGCTCA CCGTCTTTCA 4680 

TTGCCATACG GAATTCCGGA TGAGCATTCA TCAGGCGGGC AAGAATGTGA ATAAAGGCCG 4 74 0 

GATAAAACTT GTGCTTATTT TTCTTTACGG TCTTTAAAAA GGCCGTAATA TCCAGCTGAA 4 8 00 

CGGTCTGGTT ATAGGTACAT TGAGCAACTG ACTGAAATGC CTCAAAATGT TCTTTACGAT 4 86 0 

15 GCCATTGGGA TATATCAACG GTGGTATATC CAGTGATTTT TTTCTCCATG ATTAATAGAA 4 92 0 

TTATCCCCTG TTTCTACTCC CCCCCAACTT CGGAGGTCGA CCAGTACTCC GGGCGAAACT 4 98 0 

TTGTTTTTTT TTTTTCCCCC GATGCTGGAG GTCGACCAGA TGTCCGAAAG TGTCCCCCCC 504 0 

CCCCCCCCCC CCCGGCGCGG AACGGCGGGG CCACTCTGGA CTCTTTTTTT TTTTTTTTTT 510 0 

TTTTTTTTTG GGGATCCTCT AGAGTCGACC TGCAGCCCAA GCTAGCGGCC GCTAGCTTCT 516 0 

GTTTTGGCGG ATGAGAGAAG ATTTTCAGCC TGATACAGAT TAAATCAGAA CGCAGAAGCG 52 2 0 

GTCTGATAAA ACAGAATTTG C 5241 
(2) INFORMATION FOR SEQ ID NO: 3: 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

<ii) MOLECULE TYPE: RNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Influenza virus, 3' RNA sequence 

(C) INDIVIDUAL ISOLATE: Wild Type vRNA Promotor Element 



-> (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

UCGUUUUCGU CCC 13 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 
10 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : RNA (genomic) 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza virus, vRNA 5' sequence 

(C) INDIVIDUAL ISOLATE: pHL1104 vRNA Promoter Element 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
20 UCAUCUUUGU CCC 13 

(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: RNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza virus, cRNA 3' sequence 
(C) INDIVIDUAL ISOLATE: cRNA Promoter element 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



AGUAGAAACA AGGG 



14 



10 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza virus, RNA sequence 
(C) INDIVIDUAL ISOLATE: pHL1191 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTAGACTCTT CAAGCAAAAG CAGGTAGATC TTGAAAGATG AGTCTTCTAA CCGAGGTCGA 6 0 

AACGTACGTT CTCTCTATCA TCCCGTCAGG CCCCCTCAAA GCCGAGATCG CACAGAGACT 12 0 

TGAAGATGTC TTTGCAGGGA AGAACACCGA TCTTGAGGTT CTCATGGAAT GGCTAAAGAC 18 0 

AAGACCAATC CTGTCACCTC TGACTAAGGG GATTTTAGGA TTTGTGTTCA CGCTCACCGT 24 0 

GCCCAGTGAG CGAGGACTGC AGCGTAGACG CTTTGTCCAA AATGCCCTTA ATGGGAACGG 3 00 

GGATCCAAAT AACATGGACA AAGCAGTTAA ACTGTATAGG AAGCTCAAGA GGGAGATAAC 36 0 

ATTCCATGGG GCCAAAGAAA TCTCACTCAG TTATTCTGCT GGTGCACTTG CCAGTTGTAT 42 0 

GGGCCTCATA TACAACAGGA TGGGGGCTGT GACCACTGAA GTGGCATTTG GCCTGGTATG 46 0 

TGCAACCTGT GAACAGATTG CTGACTCCCA GCATCGGTCT CATAGGCAAA TGGTGACAAC 54 0 

AACCAACCCA CTAATCAGAC ATGAGAACAG AATGGTTTTA GCCAGCACTA CAGCTAAGGC 6 00 

TATGGAGCAA ATGGCTGGAT CGAGTGAGCA AGCAGCAGAG GCCATGGAGG TTGCTAGTCA 66 0 

GGCTAGGCAA ATGGTGCAAG CGATGAGAAC CATTGGGACT CATCCTAGCT CCAGTGCTGG 72 0 

TCTGAAAAAT GCTCTTCTTG AAAATTTGCA GGCCTATCAG AAACGAATGG GGGTGCAGAT 78 0 

GCAACGGTTC AAGTGATCCT CTCGCTATTG CCGCAAATAT CATTGGGATC TTGCACTTGA 84 0 
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TATTGTGGAT 


TCTTGATCGT 


CTTTTTTTCA 


AATGCATTTA 


v-p \— \j A V- \_J x X -L 




9 00 


TGAAAGGAGG 


GCCTTCTACG 


GAAGGAGTGC 


CAAAGTCTAT 


GAGGGAAGAA 




9 6 0 


AACAGCAGAG 


TGCTGTGGAT 


GCTGACGATG 


GTCATTTTGT 


CAGCATAGAG 


CTGG AflT A A A 


10 2 0 


AAACTACCAT 


ATGGGGCATG 


TCCCAAGTAT 


GTTAAGCAAA 


ACACTCTGAA 




10 8 0 


GGGATGCGGA 


ATGTACCAGA 


GAAACAAACT 


AGAGGCATAT 


TCGGCGCAAT 


ft^Unijoi i. I \_ 


114 0 


ATAGAAAATG 


GTTGGGAGGG 


AATGATAGAC 


GGTTGGTACG 


GTTTCAGGCA 


Tr*AAAATTr*r* 


12 0 0 


GAGGGCACAG 


GACAAGCAGC 


AGATCTTAAA 


AGCACTCAAG 


CAGCCATCGA 




12 6 0 


GGGAAACTGA 


ATAGGGTAAT 


CG AG AAGACG 


AACGAGAAAT 


TCCATCAAAT 


CGAAAAnriAA 


13 2 0 


TTCTCAGAAG 


TAGAAGGGAG 


AATTCAGGAC 


CTCGAGCATG 


CATCTAGAGG 


GCrrTATTfT 

*J *w \— V_ 1 rt 1 1 L 1 




ATAGTGTCAC 


CTAAATGCTA 


GAGCTCGCTG 


ATCAGCCTCG 


ACTGTGCCTT 




14 4 0 


GCCATCTGTT 


GTTTGCCCCT 


CCCCCGTGCC 


TTCCTTGACC 




u L AL I LCCAC 


150 0 


TGTCCTTTCC 


TAATAAAATG 


AGGAAATTGC 


ATCGCATTGT 


CTGAGTAGGT 

*>— a i, *"v\j\j 1 


o 1 Lrt i 1L1A1 


15 6 0 


TCTGGGGGGT 


GGGGTGGGGC 


AGGACAGCAA 


GGGGGAGGAT 


TGGGAAGACA 




16 2 0 


TGCTGGGGAT 


GCGGTGGGCT 


CTATGGCTTC 


TGAGGCGGAA 


AGAACCAGCT 




16 8 0 


GGGGTATCCC 


CACGCGCCCT 


GTAGCGGCGC 


ATTAAGCGCG 




T 1 r^"T tr T' A r~* r~< 
1 bbl 1Av-CjV_G 


174 0 


CAGCGTGACC 


GCTACACTTG 


CCAGCGCCCT 


AGCGCCCGCT 


CCTTTfnrTT 


Tr'TTfrr'TTr' 
1 \~ 1 1LLLI 1 L 


18 0 0 


CTTTCTCGCC 


ACGTTCGCCG 


GCTTTCCCCG 


TCAAGCTCTA 


AATcczczccr' a 


ItLLl iTAGG 


18 6 0 


GTTCCGATTT 


AGTGCTTTAC 


GGCACCTCGA 


CCCCAAAAAA 


TTTriATTZirr 

^— i. 1 \j rt 1 1 nou 


\j I <jA I GGTTC 


19 2 0 


ACGTAGTGGG 


CCATCGCCCT 


GATAGACGGT 


TTTTCGCCCT 




A\j 1 LLACGTT 


19 8 0 


CTTTAATAGT 


GGACTCTTGT 


TCCAAACTGG 


AACAACACTC 




f f"* Z -1 T* ^ T"T* * T*Ti 

1 L i TATT 


2 04 0 


CTTTTGATTT 


ATAAGGGATT 


TGGGGATTTC 


GGCCTATTGG 


TT A A A A A ITr 


MbLlbAI 1 TA 


210 0 


ACAAAAATTT 


AACGCGAATT 


AATTCTGTGG 


AATGTGTGTC 


ACTTAf^nrzTr: 

/A\J 1 1 rtOVJO A VJ 


1 GbAAAGTCC 


2 16 0 


CCAGGCTCCC 


CAGGCAGGCA 


GAAGTATGCA 


AAGCATGCAT 


r'TPAATTACT 


t_ ACjC_ AACCAG 


2 2 2 0 


GTGTGGAAAG 






CAGAAGTATG 


CAAAGCATGC 


ATCTCAATTA 


2280 


GTCAGCAACC 


ATAGTCCCGC 


CCCTAACTCC 


GCCCATCCCG 


CCCCTAACTC 


CGCCCAGTTC 


234 0 


CGCCCATTCT 


CCGCCCCATG 


GCTGACTAAT 


TTTTTTTATT 


TATGCAGAGG 


CCGAGGCCGC 


2400 


CTCTGCCTCT 


GAGCTATTCC 


AGAAGTAGTG 


AGGAGGCTTT 


TTTGGAGGCC 


TAGGCTTTTG 


24 6 0 
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CAAAAAGCTC CCGGGAGCTT GTATATCCAT TTTCGGATCT GATCAAGAGA CAGGATGAGG 2 52 0 

ATCGTTTCGC ATGATTGAAC AAGATGGATT GCACGCAGGT TCTCCGGCCG CTTGGGTGGA 25 8 0 

GAGGCTATTC GGCTATGACT GGGCACAACA GACAATCGGC TGCTCTGATG CCGCCGTGTT 264 0 

CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT TTTTGTCAAG ACCGACCTGT CCGGTGCCCT 27 00 

5 GAATGAACTG CAGGACGAGG CAGCGCGGCT ATCGTGGCTG GCCACGACGG GCGTTCCTTG 276 0 

CGCAGCTGTG CTCGACGTTG TCACTGAAGC GGGAAGGGAC TGGCTGCTAT TGGGCGAAGT 28 2 0 

GCCGGGGCAG GATCTCCTGT CATCTCACCT TGCTCCTGCC GAGAAAGTAT CCATCATGGC 288 0 

TGATGCAATG CGGCGGCTGC ATACGCTTGA TCCGGCTACC TGCCCATTCG ACCACCAAGC 2 94 0 

GAAACATCGC ATCGAGCGAG CACGTACTCG GATGGAAGCC GGTCTTGTCG ATCAGGATGA 3000 

10 TCTGGACGAA GAGCATCAGG GGCTCGCGCC AGCCGAACTG TTCGCCAGGC TCAAGGCGCG 3 06 0 

CATGCCCGAC GGCGAGGATC TCGTCGTGAC CCATGGCGAT GCCTGCTTGC CGAATATCAT 3120 

GGTGGAAAAT GGCCGCTTTT CTGGATTCAT CGACTGTGGC CGGCTGGGTG TGGCGGACCG 318 0 

CTATCAGGAC ATAGCGTTGG CTACCCGTGA TATTGCTGAA GAGCTTGGCG GCGAATGGGC 3 24 0 

TGACCGCTTC CTCGTGCTTT ACGGTATCGC CGCTCCCGAT TCGCAGCGCA TCGCCTTCTA 3 300 

15 TCGCCTTCTT GACGAGTTCT TCTGAGCGGG ACTCTGGGGT TCGAAATGAC CGACCAAGCG 336 0 

ACGCCCAACC TGCCATCACG AGATTTCGAT TCCACCGCCG CCTTCTATGA AAGGTTGGGC 342 0 

TTCGGAATCG TTTTCCGGGA CGCCGGCTGG ATGATCCTCC AGCGCGGGGA TCTCATGCTG 34 8 0 

GAGTTCTTCG CCCACCCCAA CTTGTTTATT GCAGCTTATA ATGGTTACAA ATAAAGCAAT 354 0 

AGCATCACAA ATTTCACAAA TAAAGCATTT TTTTCACTGC ATTCTAGTTG TGGTTTGTCC 36 00 

20 AAACTCATCA ATGTATCTTA TCATGTCTGT ATACCGTCGA CCTCTAGCTA GAGCTTGGCG 3 66 0 

TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT TCCACACAAC 3 72 0 

ATACG AGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT AATGAGTGAG CTAACTCACA 3 7 80 

TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG CCAGCTGCAT 3840 

TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC 3 9 00 

25 TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA 3960 

AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG CAGGAAAGAA CATGTGAGCA 4 02 0 

AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG 4 08 0 
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CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG 
ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT 
CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 
TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC 
TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT 
GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT 
AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAA CTACGGC 
TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA 
AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG GTAGCGGTGG TTTTTTTGTT 
TGCAAGCAGC AGATTACGCG C AG AAA AAA A GGATCTCAAG AAGATCCTTT GATCTTTTCT 
ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA 
TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA 
AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC 
TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT GTAGATAACT 
ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA TGATACCGCG AGACCCACGC 
TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT 
GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA 
AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 
TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC AAGGCGAGTT 
ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTGTC 
AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA TAATTCTCTT 
ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC 
TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC 
GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA 
CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC 
TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA 
AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT 
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4 14 0 

4200 

4260 

4320 

4380 

444 0 

4 500 

4560 

4620 

4680 

4740 

4800 
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4920 

4 980 

504 0 

5100 

516 0 

5220 

5280 

534 0 

54 00 

5460 

5520 

5580 

564 0 

5700 
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TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA 57 6 0 

TGTATTTAGA AAA AT AAA C A AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT 582 0 

GACGTCGACG GATCGGGAGA TCTCCCGATC CCCTATGGTC GACTCTCAGT ACAATCTGCT 588 0 

CTGATGCCGC ATAGTTAAGC CAGTATCTGC TCCCTGCTTG TGTGTTGGAG GTCGCTGAGT 5 94 0 

5 AGTGCGCGAG CAAAATTTAA GCTACAACAA GGCAAGGCTT GACCGACAAT TGCATGAAGA 6 000 

ATCTGCTTAG GGTTAGGCGT TTTGCGCTGC TTCGCGATGT ACGGGCCAGA TATACGCGTT 6 06 0 

GACATTGATT ATTGACTAGT TATTAATAGT AATCAATTAC GGGGTCATTA GTTCATAGCC 6 12 0 

CATATATGGA GTTCCGCGTT ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA 6180 

ACGACCCCCG CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 6 24 0 

10 CTTTCCATTG ACGTCAATGG GTGGACTATT TACGGTAAAC TGCCCACTTG GCAGTACATC 6 3 00 

AAGTGTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA TGACGGTAAA TGGCCCGCCT 636 0 

GGCATT ATGC CCAGTACATG ACCTTATGGG ACTTTCCTAC TTGGCAGTAC ATCTACGTAT 64 2 0 

TAGTCATCGC TATTACCATG GTGATGCGGT TTTGGCAGTA CATC AATGGG CGTGG ATAGC 64 8 0 

GGTTTGACTC ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT 6 54 0 

15 GGCACC AAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA TTGACGCAAA 66 0 0 

TGGGCGGTAG GCGTGTACGG TGGGAGGTCT ATATAAGCAG AGCTCTCTGG CTAACTAGAG 6 66 0 

AACCCACTGC TTACTGGCTT ATCGAAATTA ATACGACTCA CTATAGGGAG ACCCAAGCTT 6 720 

GGTACCGAGC TCGGATCCAC TAGTAACGGC CGCCAGTGTG CTGGAATTCT GCAGATTCTT 678 0 

CTCTCATCCG CCAAAACAGA AG 6 6 02 
20 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5825 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Influenza virus, RNA sequence 
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(C) INDIVIDUAL ISOLATE: pHL1489 



(xi) SEQUENCE DESCRIPTION: SE 
CGATGGTCAT TTTGTCCTGA TGAGTCCGTG 
ATGAGTCCGT GAGGACGAAA CTACCTTGTT 
5 CAACCTGCCA TCACGAGATT TCGATTCCAC 

AATCGTTTTC CGGGACGCCG GCTGGATGAT 
CTTCGCCCAC CCCGGGCTCG ATCCCCTCGC 
CGACCTCGCG GAGTTCTACC GGCAGTGCAA 
CTATCCGCGC ATCCATGCCC CCGAACTGCA 

10 CCCGGATCTT TGTGAAGGAA CCTTACTTCT 

CAGAGATTTA AAGCTCTAAG GTAAATATAA 
TGATTCTAAT TGTTTGTGTA TTTTAGATTC 
GGTGGAATGC CTTTAATGAG GAAAACCTGT 
ATGAGGCTAC TGCTGACTCT CAACATTCTA 

15 ACCCCAAGGA CTTTCCTTCA GAATTGCTAA 

GAACTCTTGC TTGCTTTGCT ATTTACACCA 
AAATTATGGA AAAATATTCT GTAACCTTTA 
TACTGTTTTT TCTTACTCCA CACAGGCATA 
AATTGTGTAC CTTTAGCTTT TTAATTTGTA 

20 GTGCCTTGAC TAGAGATCAT AATCAGCCAT 

AAAAACCTCC CACACCTCCC CCTGAACCTG 
AACTTGTTTA TTGCAGCTT A TAATGGTTAC 
AATAAAGCAT TTTTTTCACT GCATTCTAGT 
TATCATGTCT GGATCCCCAG GAAGCTCCTC 

25 TTGAGAGGAC ATTCCAATCA TAGGCTGCCC 

GTCACTTAAC AAAAAGGAAA TTGGGTAGGG 



!Q ID NO : 7 : 

AGGACGAAAC ATAGAGCTGG AGTAAAACTG 6 0 

TCTATTCGAA ATGACCGACC AAGCGACGCC 12 0 

CGCCGCCTTC TATGAAAGGT TGGGCTTCGG 180 

CCTCCAGCGC GGGGATCTCA TGCTGGAGTT 24 0 

GAGTTGGTTC AGCTGCTGCC TGAGGCTGGA 3 00 

ATCCGTCGGC ATCCAGGAAA CCAGCAGCGG 36 0 

GGAGTGGGGA GGCACGATGG CCGCTTTGGT 4 20 

GTGGTGTGAC ATAATTGGAC AAACTACCTA 4 80 

AATTTTTAAG TGTATAATGT GTTAAACTAC 54 0 

CAACCTATGG AACTGATGAA TGGGAGCAGT 6 00 

TTTGCTCAGA AGAAATGCCA TCTAGTGATG 6 60 

CTCCTCCAAA AAAGAAGAG A AAGGTAGAAG 72 0 

GTTTTTTGAG TCATGCTGTG TTTAGTAATA 78 0 

CAAAGGAAAA AGCTGCACTG CTATACAAGA 84 0 
TAAGTAGGCA TAACAGTTAT AATCATAACA 9 00 

GAGTGTCTGC TATTAATAAC TATGCTCAAA 96 0 

AAGGGGTTAA TAAGGAATAT TTGATGTATA 102 0 

ACCACATTTG TAGAGGTTTT ACTTGCTTTA 108 0 

AAA CAT AAA A TGAATGCAAT TGTTGTTGTT 114 0 

AAATAAAGCA ATAGCATCAC AAATTTCACA 12 00 

TGTGGTTTGT CCAAACTCAT CAATGTATCT 12 6 0 

TGTGTCCTCA TAAACCCTAA CCTCCTCTAC 132 0 

ATCCACCCTC TGTGTCCTCC TGTTAATTAG 138 0 

GTTTTTCACA GACCGCTTTC TAAGGGTAAT 144 0 
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TTTAAAATAT CTGGGAAGTC CCTTCCACTG 
ACAAATGTCA AC AG C AG AAA CATACAAGCT 
CTCATCAAGA AGCACTGTGG TTGCTGTGTT 
CCCACCTGTG TAGGTTCCAA AATATCTAGT 
5 GCACTCCACT GGATAAGCAT TATCCTTATC 

CTGACTGTCA ACTGTAGCAT TTTTTGGGGT 
GTTTGCTAAC ACACCCTGCA GCTCCAAAGG 
TGACCCTTGA ATGGGTTTTC CAGCACCATT 
GTTTAACATA GCAGTTACCC CAATAACCTC 
10 AATATTTCCA CAGGTTAAGT CCTCATTTAA 

GGCCTCGTGA TACGCCTATT TTTATAGGTT 
TCAGGTGGCA CTTTTCGGGG AAATGTGCGC 
CATTCAAATA TGTATCCGCT CATG AGACAA 
AAAAGGAAGA GTATGAGTAT TCAACATTTC 
15 TTTTGCCTTC CTGTTTTTGC TCACCCAGAA 

CAGTTGGGTG CACGAGTGGG TTACATCGAA 
AGTTTTCGCC CCGAAGAACG TTTTCCAATG 
GCGGTATTAT CCCGTGTTGA CGCCGGGCAA 
CAGAATGACT TGGTTGAGTA CTCACCAGTC 
20 GTAAGAGAAT TATGCAGTGC TGCCATAACC 

CTGACAACGA TCGGAGGACC GAAGGAGCTA 
GTAACTCGCC TTGATCGTTG GGAACCGGAG 
GACACCACGA TGCCTGCAGC AATGGCAACA 
CTTACTCTAG CTTCCCGGCA ACAATTAATA 
25 CCACTTCTGC GCTCGGCCCT TCCGGCTGGC 

GAGCGTGGGT CTCGCGGTAT CATTGCAGCA 
GTAGTTATCT ACACG ACGGG GAGTCAGGCA 
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CTGTGTTCCA GAAGTGTTGG TAAACAGCCC 150 0 

GTCAGCTTTG CACAAGGGCC CAACACCCTG 156 0 

AGTAATGTGC AAAACAGGAG GCACATTTTC 162 0 

GTTTTCATTT TTACTTGGAT CAGGAACCCA 16 8 0 

CAAAACAGCC TTGTGGTCAG TGTTCATCTG 174 0 

TACAGTTTGA GCAGGATATT TGGTCCTGTA 18 0 0 

TTCCCCACCA ACAGCAAAAA AATGAAAATT 186 0 

TTCATGAGTT TTTTGTGTCC CTGAATGCAA 192 0 

AGTTTTAACA GTAACAGCTT CCCACATCAA 198 0 

ATT AGG C AAA GGAATTCTTG AAGACGAAAG 2 04 0 

AATGTCATGA TAATAATGGT TTCTTAGACG 2100 

GGAACCCCTA TTTGTTTATT TTTCTAAATA 216C 

TAACCCTGAT AAATGCTTCA ATAATATTGA 222 C 

CGTGTCGCCC TTATTCCCTT TTTTGCGGCA 22 8 C 

ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT 2 34 C 

CTGGATCTCA ACAGCGGTAA GATCCTTGAG 24 OC 

ATGAGCACTT TTAAAGTTCT GCTATGTGGC 24 6 C 

GAGCAACTCG GTCGCCGCAT ACACTATTCT 2 52 C 

ACAGAAAAGC ATCTTACGGA TGGCATGACA 258 C 

ATGAGTGATA ACACTGCGGC CAACTTACTT 264 C 

ACCGCTTTTT TGCACAACAT GGGGGATCAT 27 OC 

CTGAATGAAG CCATACCAAA CGACGAGCGT 2 76 C 

ACGTTGCGCA AACTATTAAC TGGCGAACTA 282 0 

GACTGGATGG AGGCGGATAA AGTTGCAGGA 28 8 C 

TGGTTTATTG CTGATAAATC TGGAGCCGGT 2 94 C 

CTGGGGCCAG ATGGTAAGCC CTCCCGTATC 30 0 C 

ACT ATGGATG AACGAAATAG ACAGATCGCT 3 06 G 
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GAGATAGGTG 
CTTTAGATTG 
GATAATCTCA 
GTAGAAAAGA 
5 CAAACAAAAA 
CTTTTTCCGA 
TAGCCGTAGT 
CTAATCCTGT 
TCAAGACGAT 

10 CAGCCCAGCT 
GAAAGCGCCA 
GGAACAGGAG 
GTCGGGTTTC 
AGCCTATGGA 

15 TTTGCTCACA 
TTTGAGTGAG 
GAGGAAGCGG 
CACCGCATAT 
ACACTCCGCT 

20 CTGACGCGCC 
TCTCCGGGAG 
TGTGGAATGT 
TGCAAAGCAT 
CAGGCAGAAG 

25 CTCCGCCCAT 
TAATTTTTTT 
AGTGAGGAGG 



CCTCACTGAT TAAGCATTGG 
ATTTAAAACT TCATTTTTAA 
TGACCAAAAT CCCTTAACGT 
TCAAAGGATC TTCTTGAGAT 
AACCACCGCT ACCAGCGGTG 
AGGTAACTGG CTTCAGCAGA 
TAGGCCACCA CTTCAAGAAC 
TACCAGTGGC TGCTGCCAGT 
AGTTACCGGA TAAGGCGCAG 
TGGAGCGAAC GACCTACACC 
CGCTTCCCGA AG GG AG AAA G 
AGCGCACGAG GGAGCTTCCA 
GCCACCTCTG ACTTGAGCGT 
AAAACGCCAG CAACGCGGCC 
TGTTCTTTCC TGCGTTATCC 
CTGATACCGC TCGCCGCAGC 
AAGAGCGCCT GATGCGGTAT 
GGTGCACTCT CAGTACAATC 
ATCGCTACGT GACTGGGTCA 
CTGACGGGCT TGTCTGCTCC 
CTGCATGTGT CAGAGGTTTT 
GTGTCAGTTA GGGTGTGGAA 
GCATCTCAAT TAGTCAGCAA 
TATGCAAAGC ATGCATCTCA 
CCCGCCCCTA ACTCCGCCCA 
TATTTATGCA GAGGCCGAGG 
CTTTTTTGGA GGCCTAGGCT 



TAACTGTCAG ACCAAGTTTA 

TTTAAAAGGA TCTAGGTGAA 

GAGTTTTCGT TCCACTGAGC 

CCTTTTTTTC TGCGCGTAAT 

GTTTGTTTGC CGGATCAAGA 

GCGCAGATAC CAAATACTGT 

TCTGTAGCAC CGCCTACATA 

GGCGATAAGT CGTGTCTTAC 

CGGTCGGGCT GAACGGGGGG 

GAACTGAGAT ACCTACAGCG 

GCGGACAGGT ATCCGGTAAG 

GGGGGAAACG CCTGGTATCT 

CGATTTTTGT GATGCTCGTC 

TTTTTACGGT TCCTGGCCTT 

CCTGATTCTG TGGATAACCG 

CGAACGACCG AGCGCAGCGA 

TTTCTCCTTA CGC ATCTGTG 

TGCTCTGATG CCGCATAGTT 

TGGCTGCGCC CCGACACCCG 

CGGCATCCGC TTACAGACAA 

CACCGTCATC ACCGAAACGC 

AGTCCCCAGG CTCCCCAGCA 

CCAGGTGTGG AAAGTCCCCA 

ATTAGTCAGC AACCATAGTC 

GTTCCGCCCA TTCTCCGCCC 

CCGCCTCGGC CTCTGAGCTA 

TTTGCAAAAA GCTTCACGCT 



CTCATATATA 3120 

G AT C CTTTTT 318 0 

GTCAGACCCC 3 24 0 

CTGCTGCTTG 3 3 00 

GCTACCAACT 33 6 0 

CCTTCTAGTG 34 2 0 

CCTCGCTCTG 34 8 0 

CGGGTTGGAC 3 54 0 

TTCGTGCACA 3600 

TGAGCTATGA 366 0 

CGGCAGGGTC 3720 

TTATAGTCCT 3 78 0 

AGGGGGGCGG 3 84 0 

TTGCTGGCCT 3 900 

TATTACCGCC 3 96 0 

GTCAGTGAGC 4 02 0 

CGGTATTTCA 4 08 0 

AAGCCAGTAT 4 14 0 

CCAACACCCG 4200 

GCTGTGACCG 4260 

GCGAGGCAGC 4 32 0 

GGCAGAAGTA 4 380 

GGCTCCCCAG 444 0 

CCGCCCCTAA 4 5 00 

CATGGCTGAC 4 56 0 

TTCCAGAAGT 4 62 0 

GCCGCAAGCA 4 68 0 
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CTCAGGGCGC AAGGGCTGCT AAAGGAAGCG GAACACGTAG AAAGCCAGTC CGCAGAAACG 4 74 0 

GTGCTGACCC CGGATGAATG TCAGCTACTG GGCTATCTGG ACAAGGGAAA ACGCAAGCGC * 4 8 00 

AAAGAGAAAG CAGGTAGCTT GCAGTGGGCT TACATGGCGA TAGCTAGACT GGGCGGTTTT 4 860 

ATGGACAGCA AGCGAACCGG AATTGCCAGC TGGGGCGCCC TCTGGTAAGG TTGGGAAGCC 4 92 0 

5 CTGCAAAGTA AACTGGATGG CTTTCTTGCC GCCAAGGATC TGATGGCGCA GGGGATCAAG 4 980 

ATCTGATCAA GAGACAGGAT GAGGATCGTT TCGCATGATT GAAC AAGATG GATTGCACGC 5 04 0 

AGGTTCTCCG GCCGCTTGGG TGGAGAGGCT ATTCGGCTAT GACTGGGCAC AACAGACAAT 5100 

CGGCTGCTCT GATGCCGCCG TGTTCCGGCT GTCAGCGCAG GGGCGCCCGG TTCTTTTTGT 516 0 

CAAGACCGAC CTGTCCGGTG CCCTGAATGA ACTGCAGGAC GAGGCAGCGC GGCTATCGTG 5220 

10 GCTGGCCACG ACGGGCGTTC CTTGCGCAGC TGTGCTCGAC GTTGTCACTG AAGCGGGAAG 52 8 0 

GGACTGGCTG CTATTGGGCG AAGTGCCGGG GCAGG ATCTC CTGTCATCTC ACCTTGCTCC 534 0 

TGCCGAGAAA GTATCCATCA TGGCTGATGC AATGCGGCGG CTGCATACGC TTGATCCGGC 54 0 0 

TACCTGCCCA TTCGACCACC AAGCGAAACA TCGCATCGAG CGAGCACGTA CTCGGATGG A 54 6 0 

AGCCGGTCTT GTCGATCAGG ATGATCTGGA CGAAGAGCAT CAGGGGCTCG CGCCAGCCGA 5 52 0 

15 ACTGTTCGCC AGGCTCAAGG CGCGCATGCC CGACGGCGAG GATCTCGTCG TGACCCATGG S58 0 

CGATGCCTGC TTGCCGAATA TCATGGTGGA AAATGGCCGC TTTTCTGGAT TCATCGACTG 564 0 

TGGCCGGCTG GGTGTGGCGG ACCGCTATCA GGACATAGCG TTGGCTACCC GTGATATTGC 5 70C 

TGAAGAGCTT GGCGGCGAAT GGGCTGACCG CTTCCTCGTG CTTTACGGTA TCGCCGCTCC 5 76 0 

CGATTCGCAG CGCATCGCCT TCTATCGCCT TCTTGACGAG TTCTTCTGAG CGGGACTCTG 582 0 

20 GGGTT 5825 

(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4023 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(lii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE : . 

(A) ORGANISM: Influenza virus, RNA sequence 
(C) INDIVIDUAL ISOLATE: pHL1490 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

5 CAGGTACATA TTGAAAGATG AGTCTTCTAA CCGAGGTCGA AACGTACGTT CTCTCTATCA 6 0 

TCCCGTCAGG CCCCCTCAAA GCCGAGATCG CACAGAGACT TGAAGATGTC TTTGCAGGGA 12 0 

AGAACACCGA TCTTGAGGTT CTCATGGAAT GGCTAAAGAC AAGACCAATC CTGTCACCTC 18 0 

TGACTAAGGG GATTTTAGGA TTTGTGTTCA CGCTCACCGT GCCCAGTGAG CGAGGACTGC 24 0 

AGCGTAGACG CTTTGTCCAA AATGCCCTTA ATGGGAACGG GG AT CC AAA T AACATGGACA 3 00 

10 AAGCAGTTAA ACTGTATAGG AAGCTCAAGA GGGAGATAAC ATTCCATGGG G C C A AAG AAA 3 60 

TCTCACTCAG TTATTCTGCT GGTGCACTTG CCAGTTGTAT GGGCCTCATA TACAACAGGA 4 20 

TGGGGGCTGT GACCACTGAA GTGGCATTTG GCCTGGTATG TGCAACCTGT GAACAGATTG 4 80 

CTGACTCCCA GCATCGGTCT CATAGGCAAA TGGTGACAAC AACCAACCCA CTAATCAGAC 54 0 

ATGAGAACAG AATGGTTTTA GCCAGCACTA CAGCTAAGGC TATGGAGCAA ATGGCTGGAT 6 00 

15 CGAGTGAGCA AGCAGCAGAG GCCATGGAGG TTGCTAGTCA GGCTAGGCAA ATGGTGCAAG 660 

CGATGAGAAC CATTGGGACT CATCCTAGCT CCAGTGCTGG TCTGAAAAAT GCTCTTCTTG 72 0 

AAAATTTGCA GGCCTATCAG AAACGAATGG GGGTGCAGAT GCAACGGTTC AAGTGATCCT 78 0 

CTCGCTATTG CCGCAAATAT CATTGGGATC TTGCACTTGA TATTGTGGAT TCTTG ATCGT 84 0 

CTTTTTTTCA AATGCATTTA CCGTCGCTTT AAATACGGAC TGAAAGGAGG GCCTTCTACG 90 0 

20 GAAGGAGTGC CAAAGTCTAT GAGGGAAGAA TATCGAAAGG AACAGCAGAG TGCTGTGGAT 96 0 

GCTGACGATG GTCATTTTGT CAGTATAGAG CTGGAGTAAA AAAGTACCTT GTTTCTACTA 102 0 

CCTATCTCCA GGTCCAATAG GACCAGATCT AAAAGATCAC AAGCATAAAA GAGACAGGGA 108 0 

GGAAAGTGAC AGGCCACAGA GAATACCTGG AAGTCATACC TGGGGAGGTG GCCCAAAAAT 114 0 

GACCCCATAA CGAAAAGAAC CGGACCTCAA AGGAACAACT GGTCGACCTG ACAACCCGGA 120 0 

25 GAACTGATAA GACCGACAGG TCAATGAAAG AAAACCGCGC CCATAGCTTT CCCTCGGCCT 126 0 

CAGACGTACA GGGCCCGCTA ACAGCGCGAT TTGCTGGTGA CCCAATGCGA CCAGATGCTC 132 0 

CACGCCCAGT CGCGTACCGT CTTCATGGGA GAAAATAATA CTGTTGATGG GTGTCTGGTC 13 8 0 
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AG AG AC AT C A AGAAATAACG CCGGAACATT 

CTGGTCATCC AGCGGATAGT TAATGATCTA 

CGGCACCTCG CTAACGGATT CACCACTCCA 

ACTGTGAATG CGCAAACCAA CCCTTGGCAG 

5 AGCCGCACGC GGCGCATCTC GGGCCGCGTT 

TGACGAGCAT CACAAAAATC GACGCTCAAG 

AAGATACCAG GCGTTTCCCC CTGGAAGCTC 

GCTTACCGGA TACCTGTCCG CCTTTCTCCC 

ACGCTGTAGG TATCTCAGTT CGGTGTAGGT 

10 ACCCCCCGTT CAGCCCGACC GCTGCGCCTT 

GGTAAGACAC GACTTATCGC CACTGGCAGC 

GTATGTAGGC GGTGCTACAG AGTTCTTGAA 

GACAGTATTT GGTATCTGCG CTCTGCTGAA 

CTCTTGATCC GGCAAACAAA CCACCGCTGG 

15 GATTACGCGC AGAAAAAAAG GATCTCAAGA 

CGCTCAGTGG AACGAAAACT CACGTTAAGG 

CTTCACCTAG ATCCTTTTAA ATTAAAAATG 

GTAAACTTGG TCTGACAGTT ACCAATGCTT 

TCTATTTCGT TCATCCATAG TTGCCTGACT 

20 GGGCTTACCA TCTGGCCCCA GTGCTGCAAT 

AGATTTATCA GCAATAAACC AGCCAGCCGG 

TTTATCCGCC TCCATCCAGT CTATTAATTG 

AGTTAATAGT TTGCGCAACG TTGTTGCCAT 

GTTTGGTATG GCTTCATTCA GCTCCGGTTC 

25 CATGTTGTGC AAAAAAGCGG TTAGCTCCTT 

GGCCGCAGTG TTATCACTCA TGGTTATGGC 

ATCCGTAAGA TGCTTTTCTG TGACTGGTGA 
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AGTGCAGGCA GCTTCCACAG CAATGGCATC 144 0 

CCGCGGTAGA TCTAGCCCAC TGAACGCGGG 150 0 

AGAATTGGAG CCAATCAATT CTTGCGGAGA 156 0 

AACATATCCA TCGCGTCCGC CATCTCCAGC 162 0 

GCTGGCGTTT TTCCATAGGC TCCGCCCCCC 16 6 0 

TCAGAGGTGG CGAAACCCGA CAGGACTATA 174 0 

CCTCGTGCGC TCTCCTGTTC CGACCCTGCC 1800 

TTCGGGAAGC GTGGCGCTTT CTCAATGCTC 186 0 

CGTTCGCTCC AAGCTGGGCT GTGTGCACGA 192 0 

ATCCGGTAAC TATCGTCTTG AGTCCAACCC 198 0 

AGCCACTGGT AACAGGATTA GCAGAGCGAG 2 04 0 

GTGGTGGCCT AACTACGGCT ACACTAGAAG 2100 

GCCAGTTACC TTCGGAAAAA GAGTTGGTAG 216 0 

TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA 222 0 

AGATCCTTTG ATCTTTTCTA CGGGGTCTGA 22 8 0 

GATTTTGGTC ATGAGATTAT CAAAAAGGAT 2 34 0 

AAGTTTTAAA TCAATCTAAA GTATATATGA 24 0 0 

AATCAGTGAG GCACCTATCT CAGCGATCTG 24 6 0 

CCCCGTCGTG TAGATAACTA CG AT ACGGG A 2 52 0 

GATACCGCGA GACCCACGCT CACCGGCTCC 2580 

AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC 2 64 0 

TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC 2 70 0 

TGCTGCAGGC ATCGTGGTGT CACGCTCGTC 2760 

CCAACGATCA AGGCGAGTTA CATGATCCCC 2 82 0 

CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT 2880 

AGCACTGCAT AATTCTCTTA CTGTCATGCC 2 94 0 

GTACTCAACC AAGTCATTCT GAGAATAGTG 3 0 00 
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TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAACACGG GATAATACCG CGCCACATAG 

CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT 

CTTACCGCTG TTGAGATCCA GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC 

ATCTTTTACT TTCACCAGCG TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA 

AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA 

TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA 

AAATAAACAA AAGAGTTTGT AGAAACGCAA AAAGGCCATC CGTCAGGATG GCCTTCTGCT 

TAATTTGATG CCTGGCAGTT TATGGCGGGC GTCCTGCCCG CCACCCTCCG GGCCGTTGCT 

TCGCAACGTT CAAATCCGCT CCCGGCGGAT TTGTCCTACT CAGGAGAGCG TTCACCGACA 

AACAACAGAT AAAACGAAAG GCCCAGTCTT TCGACTGAGC CTTTCGTTTT ATTTGATGCC 

TGGCAGTTCC CTACTCTCGC ATGGGGAGAC CCCACACTAC CATCGGCGCT ACGGCGTTTC 

ACTTCTGAGT TCGGCATGGG GTCAGGTGGG ACCACCGCGC TACTGCCGCC AGGCAAATTC 

TGTTTTATCA GACCGCTTCT GCGTTCTGAT TTAATCTGTA TCAGGCTGAA AATCTTCTCT 

CATCCGCCAA AACAGAAGCT AGCGGCCGCT AGCTTGGGCT GCAGGTCGAC TCTAGAGGAT 

CCCCAAAAAA AAAAAAAAAA AAAAAAAAAA GAGTCCAGAG TGGCCCCGCC GTTCCGCGCC 

GGGGGGGGGG GGGGGGGGGG ACACTTTCGG ACATCTGGTC GACCTCCAGC ATCGGGGGAA 

AAAAAAAAAA CAAAGTTTCG CCCGG AGTAC TGGTCGACCT CCGAAGTTGG GGGGGAGTAG 
AAA 



306 0 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
36 00 
366 0 
3720 
3780 
384 0 

3 900 
3960 
4020 

4 023 
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CLAIMS: 



5 

2. 
3. 

10 4 

5. 

15 6. 

7. 
8 

20 



A segmented RNA virus characterized in that it comprises one or more 
segments which have been genetically modified to show improved 
transcription, replication and/or expression rates and which may contain 
non viral genes to be transferred to the host. 

The virus of claim 1, wherein one or more modifications have been 
introduced in the noncoding region(s) and/or in the coding region(s). 

The virus of any of claims 1 or 2, wherein at least one modified segment is 
derived from an original one by sequence variation(s) 

The virus of any of claims 1 or 2, wherein at least one modified segment is 
an artificial addition to the set of original segments 

The virus of claim 4, wherein the modified segment comprises a nucleotide 
sequence which codes for a protein or peptide which is foreign to the 
original virus. 

The virus of claim 5, wherein the foreign protein or peptide constitutes an 
antigen or antigen-like sequence, a T-cell epitope or related sequence 

The virus of any of claims 5 or 6, wherein the segment comprises 
repetitions of an antigen or epitope or other peptide or protein. 

The virus of any of claims 7 to 9, wherein the antigen or epitope is derived 
from HIV, Herpes-Virus, Rhinovirus, CMV, papilloma viruses, Hepatitis 
viruses and other human viruses or Hog Cholera Virus. 

A pharmaceutical preparation of a vaccine comprising the virus of any of 
claims 1 to 8. 

Use of the virus of any of claims 1 to 8 for the preparation of pharma- 
ceuticals 
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